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Abstract 

A detailed description is reported of the analysis used by the CMS Collaboration in 
the search for the standard model Higgs boson in pp collisions at the LHC, which led 
to the observation of a new boson. The data sample corresponds to integrated lumi- 
nosities up to 5.1 fb^^ at a/s = 7TeV, and up to 5.3 fb^^ at a/s = 8 TeV. The results 
for five Higgs boson decay modes 77, ZZ, WW, tt, and bb, which show a combined 
local significance of 5 standard deviations near 125 GeV, are reviewed. A fit to the 
invariant mass of the two high resolution channels, 77 and ZZ — )■ Ai, gives a mass es- 
timate of 125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV. The measurements are interpreted in the 
context of the standard model Lagrangian for the scalar Higgs field interacting with 
fermions and vector bosons. The measured values of the corresponding couplings 
are compared to the standard model predictions. The hypothesis of custodial sym- 
metry is tested through the measurement of the ratio of the couplings to the W and Z 
bosons. All the results are consistent, within their uncertainties, with the expectations 
for a standard model Higgs boson. 
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1 Introduction 

The standard model (SM) HHSl of particle physics accurately describes many experimental re- 
sults that probe elementary particles and their interactions up to an energy scale of a few hun- 
dred GeV [4]. In the SM, the building blocks of matter, the fermions, are comprised of quarks 
and leptons. The interactions are mediated through the exchange of force carriers: the photon 
for electromagnetic interactions, the W and Z bosons for weak interactions, and the gluons for 
strong interactions. All the elementary particles acquire mass through their interaction with 
the Higgs field [S-jlSJ. This mechanism, called the "Higgs" or "BEH" mechanism l|5l-[T0|, is the 
first coherent and the simplest solution for giving mass to W and Z bosons, while still preserv- 
ing the symmetry of the Lagrangian. It is realized by introducing a new complex scalar field 
into the model. By construction, this field allows the W and Z bosons to acquire mass whilst 
the photon remains massless, and adds to the model one new scalar particle, the SM Higgs 
boson (H). The Higgs scalar field and its conjugate can also give mass to the fermions, through 
Yukawa interactions IITT] - [T3ll . The SM does not directly predict the values of the masses of the 
elementary particles, and in the same context there is no prediction for the Higgs boson mass. 
The particle masses are considered parameters to be determined experimentally. Nevertheless, 
a number of very general arguments [14-17] have been used to narrow the range of possible 
values for the Higgs boson mass to below approximately 1 TeV. The wealth of electroweak 
precision data from the LEP and SLC colliders, the Tevatron, and other experiments predicted 
the Higgs boson mass to be at approximately 90 GeV, with an upper limit of 152 GeV at the 
95% confidence level (CL) [4J. Direct searches at LEP excluded values lower than 114.4 GeV at 
95% CL IITSlI , and early Tevatron measurements excluded the mass range 162-166 GeV at 95% 
CLim. 

The discovery or exclusion of the SM Higgs boson is one of the primary scientific goals of the 
LHC. Previous direct searches at the LHC were based on data from proton-proton collisions 
corresponding to an integrated luminosity of 5.1 fb^^ collected at a centre-of-mass energy of 
7 TeV. The CMS experiment excluded at 95% CL masses from 127 to 600 GeV [20J. The ATLAS 
experiment excluded at 95% CL the ranges 111.4-116.4, 119.4-122.1, and 129.2-541 GeV ll2TI . 
Within the remaining allowed mass region, an excess of events between 2 and 3 standard devi- 
ations (cr) near 125 GeV was reported by both experiments. In 2012, the proton-proton centre-of- 
mass energy was increased to 8 TeV, and by the end of June, an additional integrated luminosity 
of more than 5.3 fb^^ had been recorded by each of the two experiments, thereby enhancing sig- 
nificantly the sensitivity of the search for the Higgs boson. The result was the observation by the 
ATLAS and CMS Collaborations of a new heavy boson with a mass of approximately 125 GeV. 
The two experiments simultaneously published the observation in concise papers p2ll23l . The 
CMS publication Il23l focused on the observation in the five main decay channels in the low- 
mass range from 110 to 145 GeV: H ^ 77, H ^ ZZ ^ 4£, H ^ WW iviv, H tt, and 
H — )■ bb, where I stands for electron or muon, and for simplicity our notation does not distin- 
guish between particles and antiparticles. In the summer 2012 the analysis of the full data set 
by the CDF and DO Collaborations resulted in an excess of events of about 3c in the mass range 
120 < /Wh ^ 135 GeV, while searching for a SM Higgs boson decaying into b quarks ||24| . 

The channels with the highest sensitivity for discovering the SM Higgs boson with a mass 
near 125 GeV are H — > 77 and H — > ZZ — > 4£. The other three final states have poorer mass 
resolution and, therefore, necessitate more data to achieve a similar sensitivity. Among them, 
the H — >• WW — > iviv channel has the largest signal-to-background ratio. These five channels 
are complementary in the way they are measured in the detector, as is the information they can 
provide about the SM Higgs boson. 
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A light Higgs boson has a natural width of a few MeV Il25ll , and therefore the precision of the 
mass measurement from fully reconstructed decays would be limited by the detector resolu- 
tion. The first two channels, H — )■ 77 and H — ?► ZZ — ?► A£, produce a narrow mass peak. 
These two high-resolution channels were used to measure the mass of the newly observed 
particle |I22H23|. 

In the SM, the properties of the Higgs boson are fully determined once its mass is known. All 
cross sections and decay fractions are predicted ||25H26l, and thus the measured rates into each 
channel provide a test of the SM. The individual measurements can be combined, and from 
them the coupling constants of the Higgs boson with fermions and bosons can be extracted. 
The measured values can shed light on the nature of the newly observed particle because the 
Higgs boson couplings to fermions are qualitatively different from those to bosons. 

The data described in this paper are identical to those reported in the observation publica- 
tion [23 J. The main focus of this paper is an in-depth description of the five main analyses and 
a more detailed comparison of the various channels with the SM predictions by evaluating the 
couplings to fermions and vector bosons, as well as various coupling ratios. 

The paper is organized into several sections. Sections 2 and 3 contain a short description of 
the CMS detector and the event reconstruction of physics objects relevant for the Higgs bo- 
son search. Section 4 describes the data sample, the Monte Carlo (MC) event generators used 
for the signal and background simulation, and the evaluation of the signal sensitivity. Then 
the analyses of the five decay channels are described in detail in Sections 5 to 9. In the last 
section, the statistical method used to combine the five channels and the statistical treatment 
of the systematic uncertainties are explained. Finally, the results are combined and the first 
measurements of the couplings of the new particle to bosons and fermions are presented. 

2 The CMS experiment 

The discovery capability for the SM Higgs boson is one of the main benchmarks that went into 
optimizing the design of the CMS experiment p/HSOt . 

The central feature of the detector fSOl is a superconducting solenoid 13 m long, with an inter- 
nal diameter of 6 m. The solenoid generates a uniform 3.8 T magnetic field along the axis of the 
LHC beams. Within the field volume are a silicon pixel and strip tracker, a lead tungstate crys- 
tal electromagnetic calorimeter (ECAL), and a brass /scintillator hadron calorimeter (HCAL). 
Muons are identified and measured in gas-ionization detectors embedded in the outer steel 
magnetic flux return yoke of the solenoid. The detector is subdivided into a cylindrical barrel 
and endcap disks on each side of the interaction point. Forward calorimeters complement the 
coverage provided by the barrel and endcap detectors. 

The CMS experiment uses a right-handed coordinate system, with the origin at the nominal in- 
teraction point, the x axis pointing to the centre of the LHC, the y axis pointing up (perpendic- 
ular to the LHC plane), and the z axis along the anticlockwise-beam direction. The azimuthal 
angle (p is measured in the x-y plane. The pseudorapidity is defined as // = — ln[tan {6/2)] 
where the polar angle 9 is measured from the positive z axis. The centre-of-mass momentum 
of the colliding partons in a proton-proton collision is subject to Lorentz boosts along the beam 
direction relative to the laboratory frame. Because of this effect, the pseudorapidity, rather 
than the polar angle, is a more natural measure of the angular separation of particles in the rest 
frame of the detector. 

Charged particles are tracked within the pseudorapidity range < 2.5. The silicon pixel 
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tracker is composed of 66 million pixels of area 100 x 150 f/m^, arranged in three barrel layers 
and two endcap disks at each end. The silicon strip tracker, organized in ten barrel layers 
and twelve endcap disks at each end, is composed of 9.3 million strips with pitch between 
80 and 205 /^m, with a total silicon surface area of 198 m^. The performance of the tracker 
is essential to most analyses in CMS and has reached the design performance in transverse- 
momentum (px) resolution, efficiency, and primary- and secondary-vertex resolutions. The 
tracker has an efficiency larger than 99% for muons with pj > 1 GeV, a pj resolution between 2 
and 3% for charged tracks of pj ~ 100 GeV in the central region {\t]\ < 1 .5), and unprecedented 
capabilities for b-jet identification. Measurements of the impact parameters of charged tracks 
and secondary vertices are used to identify jets that are likely to contain the hadronization and 
decay products of b quarks ("b jets"). A b-jet tagging efficiency of more than 50% is achieved 
with a rejection factor for light-quark jets of ~200, as measured with tt events in data (311. 
The dimuon mass resolution at the Y mass, dominated by instrumental effects, is measured 
to be 0.6% in the barrel region |32|, consistent with the design goal. Due to the high spatial 
granularity of the pixel detector, the channel occupancy is less than lO^'^, allowing charged- 
particle trajectories to be measured in the high-rate environment of the LHC without loss of 
performance. 

The ECAL is a fine-grained, homogeneous calorimeter consisting of more than 75000 lead 
tungstate crystals, arranged in a quasi-projective geometry and distributed in a barrel region 
(I?/ 1 < 1.48) and two endcaps that extend up to \rj\ = 3.0. The front-face cross section of the 
crystals is approximately 22 x 22 mm'^ in the barrel region and 28.6 x 28.6 mm^ in the end- 
caps. Preshower detectors consisting of two planes of silicon sensors interleaved with a total 
of three radiation lengths of lead absorber are located in front of the endcaps. Electromagnetic 
(EM) showers are narrowly distributed in the lead tungstate crystals (Moliere radius of 21 mm), 
which have a transverse size comparable to the shower radius. The precise measurement of the 
transverse shower shape is the primary method used for EM particle identification, and mea- 
surements in the surrounding crystals are used for isolation criteria. The energy resolution of 
the ECAL is the single most important performance benchmark for the measurement of the 
Higgs boson decay into two photons and to a lesser extent for the decay to ZZ that subse- 
quently decay to electrons. In the central barrel region, the energy resolution of electrons that 
interact minimally with the tracker material indicates that the resolution of unconverted pho- 
tons is consistent with design goals. The energy resolution for photons with transverse energy 
of ~60GeV varies between 1.1% and 2.5% over the solid angle of the ECAL barrel, and from 
2.2% to 5% in the endcaps. For ECAL barrel unconverted photons the diphoton mass resolution 
is estimated to be 1.1 GeV at a mass of 125 GeV. 

The HC AL barrel and endcaps are sampling calorimeters composed of brass and plastic scintil- 
lator tiles, covering < 3.0. The hadron calorimeter thickness varies from 7 to 11 interaction 
lengths within the solenoid, depending on \t]\; a scintillator "tail catcher" placed outside the 
coil of the solenoid, just in front of the innermost muon detector, extends the instrumented 
thickness to more than 10 interaction lengths. Iron forward calorimeters with quartz fibres, 
read out by photomultipliers, extend the calorimeter coverage up to \ri\ = 5.0. 

Muons are measured in the range < 2.4, with detection planes based on three technologies: 
drift tubes (\rj\ < 1.2), cathode strip chambers (0.9 < \ri\ < 2.4), and resistive-plate chambers 
(|)/| < 1.6). The first two technologies provide a precise position measurement and trigger, 
whilst the third one provides precise timing information, as well as a second independent trig- 
ger. The muon system consists of four stations in the barrel and endcaps, designed to ensure 
robust triggering and detection of muons over a large angular range. In the barrel region, each 
muon station consists of twelve drift-tube layers, except for the outermost station, which has 
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eight layers. In the endcaps, each muon station consists of six detection planes. The precision 
of the r-(p measurement is 100 jim in the drift tubes and varies from 60 to 140 }im in the cathode 
strip chambers, where r is the radial distance from the beamline and (p is the azimuthal angle. 

The CMS trigger and data acquisition systems ensure that data samples with potentially in- 
teresting events are recorded with high efficiency. The first-level (LI) trigger, composed of the 
calorimeter, muon, and global-trigger processors, uses coarse-granularity information to select 
the most interesting events in less than 4 jis. The detector data are pipelined to ensure negligible 
deadtime up to a LI rate of 100 kHz. After LI triggering, data are transferred from the read- 
out electronics of all subdetectors through the readout network to the high-level-trigger (HLT) 
processor farm, which assembles the full event and executes global reconstruction algorithms. 
The HLT filters the data, resulting in an event rate of ~500 Hz stored for offline processing. 

All data recorded by the CMS experiment are accessible for offline analysis through the world- 
wide LHC computing grid. The CMS experiment employs a highly distributed computing 
infrastructure, with a primary Tier-0 centre at CERN, supplemented by seven Tier-1, more than 
50 Tier-2, and over 100 Tier-3 centres at national laboratories and universities throughout the 
world. The CMS software running on this high-performance computing system executes a 
multitude of crucial tasks, including the reconstruction and analysis of the collected data, as 
well as the generation and detector modelling of MC simulation. 



3 Event reconstruction 

Figure [l] shows the distribution of the number of vertices reconstructed per event in the 2011 
and 2012 data, and the display of a four-lepton event recorded in 2012. The large number of 
proton-proton interactions occurring per LHC bunch crossing ("pileup"), on average of 9 in 
2011 and 19 in 2012, makes the identification of the vertex corresponding to the hard-scattering 
process nontrivial, and affects most of the physics objects: jets, lepton isolation, etc. The track- 
ing system is able to separate collision vertices as close as 0.5 mm along the beam direction [33l. 
For each vertex, the sum of the pj of all tracks associated with the vertex is computed. The 
vertex for which this quantity is the largest is assumed to correspond to the hard-scattering 
process, and is referred to as the primary vertex in the event reconstruction. In the H — > 77 
final state, a large fraction of the transverse momentum produced in the collision is carried by 



the photons, and a dedicated algorithm, described in Section 5.2 is therefore used to assign the 
photons to a vertex. 

A particle-flow (PF) algorithm ||34ll35l combines the information from all CMS subdetectors to 
identify and reconstruct the individual particles emerging from all vertices: charged hadrons, 
neutral hadrons, photons, muons, and electrons. These particles are then used to reconstruct 
the missing transverse energy, jets, and hadronic r-lepton decays, and to quantify the isolation 
of leptons and photons. 

Electrons and photons can interact with the tracker material before reaching the ECAL to create 
additional electrons and photons through pair production and bremsstrahlung radiation. A 
calorimeter superclustering algorithm is therefore used to combine the ECAL energy deposits 
that could correspond to a photon or electron. In the barrel region, superclusters are formed 
from five-crystal-wide areas in rj, centred on the locally most-energetic crystal and having a 
variable extension in (p. In the endcaps, where the crystals are arranged according to an x- 
y rather than rj-(p geometry, matrices of 5 x 5 crystals around the most-energetic crystals are 
merged if they lie within a narrow road in r]. 
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Figure 1: Left: probability distribution for the number of vertices Nvertices reconstructed per 
event in the 2011 and 2012 data. The a/s = 7 and 8TeV probability distributions are weighted 
by their equivalent integrated luminosity, and by the corresponding total cross section cr(pp — )■ 
H + X) for a SM Higgs boson of mass 125 GeV. Right: display of a four-lepton event recorded 
in 2012, with 24 reconstructed vertices. The four leptons are shown as thick lines and originate 
from the vertex chosen for the hard-scattering process. 

The stability and uniformity of the ECAL response must be calibrated at a fraction of a percent 
to maintain the excellent intrinsic energy resolution of the ECAL |36|. A dedicated monitoring 
system, based on the injection of laser light into each crystal, is used to track and correct for 
channel response changes caused by radiation damage and subsequent recovery of the crys- 
tals [37 1 . Response variations are a few percent in the barrel region, and increase up to a few 
tens of percent in the most-forward endcap regions. The channel-to-channel response is equal- 
ized using several techniques that exploit reference signatures from collision events (mainly 
Tz^,rj — ?► 77) [38j. The residual miscalibration of the channel response varies between 0.5% in 
the central barrel to a few percent in the endcaps [39 J. At the reconstruction level, additional 
correction factors to the photon energy are applied. These corrections are sizeable for photons 
that convert before entering the ECAL, for which the resolution is mainly limited by shower- 
loss fluctuations. Given the distribution of the tracker material in front of the ECAL, these 
effects are sizeable for I)/] > 1 |39|. 

Candidate photons for the H — > 77 search are reconstructed from the superclusters, and their 



identification is discussed in Section 5.3 The photon energy is computed starting from the raw 
supercluster energy. In the region covered by the preshower detector {\ri\ > 1.65), the energy 
recorded in that detector is added. In order to obtain the best resolution, the raw energy is 
corrected for the containment of the shower in the clustered crystals and for the shower losses 
of photons that convert in the tracker material before reaching the calorimeter. These correc- 
tions are computed using a multivariate regression technique based on the boosted decision 
tree (BDT) implementation in TMVA [40]. The regression is trained on photons from a sample 
of simulated events using the ratio of the true photon energy to the raw energy as the target 
variable. The input variables are the rj and (p coordinates of the supercluster, a collection of 
shower-shape variables, and a set of energy-deposit coordinates defined with respect to the 
supercluster. A second BDT, using the same input variables, is trained on a separate sample of 



6 



3 Event reconstruction 



simulated photons to provide an estimate of the uncertainty in the energy value provided by 
the first BDT. 

The width of the reconstructed Z resonance is used to quantify the ECAL performance, us- 
ing decays to two electrons whose energies are measured using the ECAL alone, with their 
direction determined from the tracks. In the 7 TeV data set, the dielectron mass resolution at 
the Z boson mass, fitting for the measurement contribution separately from the natural width, 
is 1.56 GeV in the barrel and 2.57 GeV in the endcaps, while in the 8 TeV data sample, recon- 
structed with preliminary calibration constants, the corresponding values are 1.61 GeV and 
3.75 GeV. 

Electron reconstruction is based on two methods: the first where an ECAL supercluster is used 
to seed the reconstruction of a charged-particle trajectory in the tracker [41 , 42], and the sec- 
ond where a candidate track is used to reconstruct an ECAL supercluster ||43| . In the latter, 
the electron energy deposit is found by extrapolating the electron track to the ECAL, and the 
deposits from possible bremsstrahlung photons are collected by extrapolating a straight line 
tangent to the electron track from each tracker layer, around which most of the tracker mate- 
rial is concentrated. In both cases, the trajectory is fitted with a Gaussian sum filter 11441 using 
a dedicated modelling of the electron energy loss in the tracker material. Merging the out- 
put of these two methods provides high electron reconstruction efficiency within \t]\ < 2.5 
and Pj > 2 GeV. The electron identification relies on a tmva BDT that combines observables 
sensitive to the amount of bremsstrahlung along the electron trajectory, the geometrical and 
momentum matching between the electron trajectory and the associated supercluster, as well 
as the shower-shape observables. 

Muons are reconstructed within \ri\ < 2.4 and down to a pj of 3 GeV. The reconstruction com- 
bines the information from both the silicon tracker and the muon spectrometer. The matching 
between the tracker and the muon system is initiated either "outside-in", starting from a track 
in the muon system, or "inside-out", starting from a track in the silicon tracker. Loosely identi- 
fied muons, characterized by minimal requirements on the track components in the muon sys- 
tem and taking into account small energy deposits in the calorimeters that match to the muon 
track, are identified with an efficiency close to 100% by the PF algorithm. In some analyses, 
additional tight muon identification criteria are applied: a good global muon-track fit based on 
the tracker and muon chamber hits, muon track-segment reconstruction in at least two muon 
stations, and a transverse impact parameter with respect to the primary vertex smaller than 
2 mm. 

Jets are reconstructed from all the PF particles using the anti-A;x jet algorithm [45^| implemented 
in FASTJET ||46| , with a distance parameter of 0.5. The jet energy is corrected for the contribu- 
tion of particles created in pileup interactions and in the underlying event. This contribution 
is calculated as the product of the jet area and an event-by-event pj density p, also obtained 
with FASTJET using all particles in the event. Charged hadrons, photons, electrons, and muons 
reconstructed by the PF algorithm have a calibrated momentum or energy scale. A residual 
calibration factor is applied to the jet energy to account for imperfections in the neutral-hadron 
calibration, the jet energy containment, and the estimation of the contributions from pileup 
and underlying-event particles. This factor, obtained from simulation, depends on the jet pj 
and f], and is of the order of 5% across the whole detector acceptance. Finally, a percent-level 
correction factor is applied to match the jet energy response in the simulation to the one ob- 
served in data. This correction factor and the jet energy scale uncertainty are extracted from a 
comparison between the data and simulation of 7+jets, Z+jets, and dijet events [47J. Particles 
from different pileup vertices can be clustered into a pileup jet, or significantly overlap a jet 
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from the primary vertex below the pj threshold applied in the analysis. Such jets are identified 
and removed using a TMVA BDT with the following input variables: momentum and spatial 
distribution of the jet particles, charged- and neutral-particle multiplicities, and consistency of 
charged hadrons within the jet with the primary vertex. 

The missing transverse energy (£^'^^) vector is calculated as the negative of the vectorial sum 
of the transverse momenta of all particles reconstructed by the PF algorithm. The resolution 
^(E^^^) on either the x or y component of the E^®* vector is measured in Z — > /^/^ events 
and parametrized by c(E^**) = 0.5 x ^/'LEj, where EEj is the scalar sum of the transverse 
momenta of all particles, with cr and SEj expressed in GeV. In 2012, with an average number 
of 19 pileup interactions, SEj ~ 600 GeV for the analyses considered here. 

Jets originating from b-quark hadronization are identified using different algorithms that ex- 
ploit particular properties of such objects ||3T]| . These properties, which result from the rela- 
tively large mass and long lifetime of b quarks, include the presence of tracks with large impact 
parameters, the presence of secondary decay vertices displaced from the primary vertex, and 
the presence of low-pj leptons from semileptonic b-hadron decays embedded in the jets [|3T|. 
A combined secondary-vertex (CSV) b-tagging algorithm, used in the H — > bb and H — > tt 
searches, makes use of the information about track impact parameters and secondary vertices 
within jets in a likelihood discriminant to provide separation of b jets from jets originating from 
gluons, light quarks, and charm quarks. The efficiency to tag b jets and the rate of misidentifi- 
cation of non-b jets depends on the algorithm used and the operating point chosen. These are 
typically parameterized as a function of the transverse momentum and rapidity of the jets. The 
performance measurements are obtained directly from data in samples that can be enriched in 
b jets, such as tt and multijet events. 

Hadronically decaying t leptons (t;,) are reconstructed and identified using an algorithm |j48l 
which targets the main decay modes by selecting candidates with one charged hadron and 
up to two neutral pions, or with three charged hadrons. A photon from a neutral-pion decay 
can convert in the tracker material into an electron and a positron, which can then radiate 
bremsstrahlung photons. These particles give rise to several ECAL energy deposits at the same 
Tj value and separated in azimuthal angle, and are reconstructed as several photons by the 
PF algorithm. To increase the acceptance for such converted photons, the neutral pions are 
identified by clustering the reconstructed photons in narrow strips along the (p direction. The T;, 
from W, Z, and Higgs boson decays are typically isolated from the other particles in the event, 
in contrast to misidentified from jets that are surrounded by the jet particles not used in the 
T;, reconstruction. The Th isolation parameter RJ^^ is obtained from a multivariate discriminator, 
taking as input a set of transverse momentum sums Sj = Pxi,], where pjjj is the transverse 
momentum of a particle i in a ring j centred on the T/, candidate direction and defined in [t], (p) 
space. Five equal-width rings are used up to a distance AR = ^y (A?/)^ + (A^)^ = 0.5 from 
the T}j candidate, where Ai] and Acp are the pseudorapidity and azimuthal angle differences (in 
radians), respectively, between the particle and the T;, candidate direction. The effect of pileup 
on the isolation parameter is mainly reduced by discarding from the Sj calculation the charged 
hadrons with a track originating from a pileup vertex. The contribution of pileup photons and 
neutral hadrons is handled by the discriminator, which also takes as input the pj density p. 

The isolation parameter of electrons and muons is defined relative to their transverse momen- 
tum pj as 
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Table 1: Summary information on the analyses included in this paper. The column "H prod." 
indicates the production mechanism targeted by an analysis; it does not imply 100% purity 
(e.g. analyses targeting vector-boson fusion (VBF) production are expected to have 30%-50% 
of their signal events coming from gluon-gluon fusion production). The main contribution 
in the untagged and inclusive categories is always gluon-gluon fusion. A final state can be 
further subdivided into multiple categories based on additional jet multiplicity reconstructed 
mass, transverse momentum, or multivariate discriminators. Notations used are: (jj)vBF stands 
for a dijet pair consistent with topology (VBF-tag); V = Wf and Z bosons; same flavour (SF) 
dileptons = ee or fijj. pairs; different flavour (DF) dileptons = e}i pairs; T;, = t leptons decaying 
hadronically. FH stands for associated production with a vector boson. 



H decay 


H prod. 


Exclusive final states 


No. of 


fWH range 












analysed 


channels 


(GeV) 


resolution 


7 TeV 


8 TeV 




untagged 


4 diphoton categories 


4 


110-150 


1-2% 


5.1 


5.3 


77 


VBF-tag 


77 + (jj)vBF 

2 mjj categories for 8 TeV 


lor 2 


110-150 


1-2% 


5.1 


5.3 


ZZ^4£ 


inclusive 


4e, 4^, le2}i 


3 


110-180 


1-2% 


5.0 


5.3 




or 1 jet 


DF or SF dileptons 


4 


110-160 


20% 


4.9 


5.1 


WW iviv 


VBF-tag 


(jj)vBF 

DF or SF dileptons for 8 TeV 


lor 2 


110-160 


20% 


4.9 


5.1 


TT 


or 1 jet 


(eT,„ fiT,„ e}i, nil) 

2 categories and or 1 jet 


16 


110-145 


20% 


4.9 


5.1 


VBF-tag 


(eTft, fiT,„ en, nn) + (jj)vBF 


4 


110-145 


20% 


4.9 


5.1 


hh 


VH-tag 


{yv, ee, nn, ev, piv + 2h jets) 
2 pj categories 


10 


110-135 


10% 


5.0 


5.1 



where I^charged Pt, Eneut. had. Pt, arid pj are, respectively, the scalar sums of the transverse 
momenta of charged hadrons, neutral hadrons, and photons located in a cone centred on the 
lepton direction in (?/, (^) space. The cone size AR is taken to be 0.3 or 0.4 depending on the 
analysis. Charged hadrons associated with pileup vertices are not considered, and the contri- 
bution of pileup photons and neutral hadrons is estimated as the product of the neutral-particle 
Pt density pneutrai arid an effective cone area Agff. The neutral-particle pj density is obtained 
with FASTJET using all PF photons and neutral hadrons in the event, and the effective cone area 
is slightly different from the actual cone area, being computed in such a way so as to absorb 
the residual dependence of the isolation efficiency on the number of pileup collisions. 

4 Data sample and analyses performance 

The data have been collected by the CMS experiment at a centre-of-mass energy of 7 TeV in the 
year 2011, corresponding to an integrated luminosity of about 5.1 fb^^, and a centre-of-mass 
energy of 8 TeV in the year 2012, corresponding to an integrated luminosity of about 5.3 fb^^. 

A summary of all analyses described in this paper is presented in Table [ij where we list their 
main characteristics, namely: exclusive final states, Higgs boson mass range of the search, in- 
tegrated luminosity used, and the approximate experimental mass resolution. The presence of 
a signal in one of the channels at a certain value of the Higgs boson mass, jwh/ should mani- 
fest itself as an excess in the corresponding invariant-mass distribution extending around that 
value for a range corresponding to the mn resolution. 
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4.1 Simulated samples 

MC simulation samples for the SM Higgs boson signal and background processes are used to 
optimize the event selection, evaluate the acceptance and systematic uncertainties, and predict 
the expected yields. They are processed through a detailed simulation of the CMS detector 
based on Geant4 [49J and are reconstructed with the same algorithms used for the data. The 
simulations include pileup interactions properly reweighted to match the distribution of the 
number of such interactions observed in data. For leading-order generators the default set 
of parton distribution functions (PDF) used to produce these samples is CTEQ6L [50 1, while 
CTIO |51| is employed for next-to-leading-order (NLO) generators. For all generated samples 
the hadronization is handled by pythia 6.4 1521 or HERWIG++ ||53|, and the tauola ||54l 
package is used for t decays. The pythia parameters for the underlying event and pileup 
interactions are set to the Z2 tune [55] for the 7 TeV data sample and to the Z2* tune |55|| for the 
8 TeV data sample. 

4.2 Signal simulation 

The Higgs boson can be produced in pp collisions via four different processes: gluon-gluon 
fusion, vector-boson fusion, associated production with a vector boson, and associated pro- 
duction with a tt pair. Simulated Higgs boson signals from gluon-gluon fusion (gg — )■ H), and 
vector-boson fusion (VBF) (qq qqH), are generated with powheg I|56ti58l at NLO. The sim- 
ulation of associated-production samples uses pythia, with the exception of the H — )■ bb anal- 
ysis that uses POWHEG interfaced to HERWIG++. Events at the generator level are reweighted 
according to the total cross section c^(pp — > H), which contains contributions from gluon-gluon 
fusion up to next-to-next-to-leading order (NNLO) and next-to-next-to-leading-log (NNLL) 
terms p5l |59] - [73| , vector-boson fusion including NNLO quantum chromodynamic (QCD) and 
NLO electroweak (EW) terms [25HM178J, associated production VH (where V = Z,W) at 
NNLO QCD and NLO EW [,79rr84l, and the production in association with tt at NLO QCD [,85^ 

ESI. 

For the four-fermion final states the total cross section is scaled by the branching fraction 
B{H -> U) calculated with the prophecy4f program ||89l|90|. The calculations include NLO 
QCD and EW corrections, and all interference effects up to NLO [25} |26l |89}H|. For all the 
other final states hdecay |l9lH92l is used, which includes NLO QCD and NLO EW corrections. 
The predicted signal cross sections at 8 TeV and branching fraction for a low-mass Higgs boson 
are shown in the left and right plots of Fig.|2| respectively Il25ll26l| . 

The uncertainty in the signal cross section related to the choice of PDFs is determined with the 
PDF4LHC prescription ||95ti99l. The uncertainty due to the higher-order terms is calculated by 
varying the renormalization and factorization scales in each process, as explained in Ref. [25|. 

For the dominant gluon-gluon fusion process, the transverse momentum spectrum of the Higgs 
boson in the 7 TeV MC simulation samples is reweighted to match the NNLL + NLO distribu- 
tion computed with HqT BlOOl IIOII (and fehipro II102[ 11031 for the high-px range in the tt 
analysis), except in the H — ?► ZZ analysis, where the reweighting is not necessary. At 8 TeV, 
POWHEG was tuned to reach a good agreement of the pj spectrum with the NNLL + NLO 
prediction in order to make reweighting unnecessary |[26l. 

4.3 Background simulation 

The background contribution from ZZ production via qq is generated at NLO with POWHEG, 
while other diboson processes (WW, WZ) are generated with MadGraph |104[ |1051 with cross 
sections rescaled to NLO predictions. The pythia generator is also used to simulate all diboson 
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Figure 2: Higgs boson production cross sections at = 8 TeV (left) and branching fractions 
(right) as a function of the Higgs boson mass from Refs. I|25ll26| . The width of the lines repre- 
sents the total theoretical uncertainty in the cross section and in the branching fractions. 



processes. The gg — >VV contributions are generated with GG2vv II106L The V+jets and V7 
samples are generated with MadGraph, as are contributions to inclusive Z and W production, 
with cross sections rescaled to NNLO predictions. Single-top-quark and tt events are generated 
at NLO with POWHEG. The PYTHIA generator takes into account the initial-state and final-state 
radiation effects that can lead to the presence of additional hard photons in an event. The 
MadGraph generator is also used to generate samples of tt events. QCD events are generated 
with PYTHIA. Table |2] summarizes the generators used for the different analyses. 

4.4 Search sensitivities 

The search sensitivities of the different channels, for the recorded luminosity used in the analy- 
ses, expressed in terms of the median expected 95% CL upper limit on the ratio of the measured 
signal cross section, a, and the predicted SM Higgs boson cross section, asm, are shown in Fig. |3] 
(left) as a function of the Higgs boson mass. A channel showing values below unity (dashed 
horizontal line) for a given mass hypothesis would be expected, in the absence of a Higgs boson 
signal, to exclude the standard model Higgs boson at 95% CL or more at that mass. Figure |3] 
(right) shows the expected sensitivities for the observation of the Higgs boson in terms of local 
p-values and significances as a function of the Higgs boson mass. The local p-value is defined 
as the probability of a background fluctuation; it measures the consistency of the data with the 
background-only hypothesis. 

The overall statistical methodology used in this paper was developed by the ATLAS and CMS 
Collaborations in the context of the LHC Higgs Combination Group II107I . A summary of our 



usage of this methodology in the search for the Higgs boson is given in Section 10 



5 H ^ 77 

In the H — ?► 77 analysis, a search is made for a narrow peak, of width determined by the ex- 
perimental resolution of ~1%, in the diphoton invariant-mass distribution for the range 110- 
150 GeV, on top of a large irreducible background from the production of two photons originat- 
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Table 2: Summary of the generators used for the simulation of the main backgrounds for the 
analyses presented in this paper. 
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Figure 3: The median expected 95% CL upper limits on the cross section ratio a/ a^u in the 
absence of a Higgs boson (left) and the median expected local p-value for observing an excess, 
assuming that a Higgs boson with that mass exists (right), as a function of the Higgs boson 
mass for the five Higgs boson decay channels and their combination. 



12 



5 H — )• 77 



ing directly from the hard-scattering process. In addition, there is a sizable amount of reducible 
background in which one or both of the reconstructed photons originate from the misidentifica- 
tion of particles in jets that deposit substantial energy in the ECAL, typically photons from the 
decay of or // mesons. Early studies indicated this to be one of the most promising channels 
in the search for a SM Higgs boson in the low-mass range II108L 

To enhance the sensitivity of the analysis, candidate diphoton events are separated into mutu- 
ally exclusive classes with different expected signal-to-background ratios, based on the prop- 
erties of the reconstructed photons and the presence or absence of two jets satisfying criteria 
aimed at selecting events in which a Higgs boson is produced through the VBF process. The 
analysis uses multivariate techniques for the selection and classification of the events. As in- 
dependent cross-checks, two additional analyses are performed. The first is almost identical to 
the CMS analysis described in Ref. [109J, but uses simpler criteria based on the properties of 
the reconstructed photons to select and classify events. The second analysis incorporates the 
same multivariate techniques described here, however, it relies on a completely independent 



modelling of the background. These two analyses are described in more detail in Section 5.6 

5.1 Diphoton trigger 

All the data under consideration have passed at least one of a set of diphoton triggers, each 
using transverse energy thresholds and a set of additional photon selections, including criteria 
on the isolation and the shapes of the reconstructed energy clusters. The transverse energy 
thresholds were chosen to be at least 10% lower than the envisaged final-selection thresholds. 
This set of triggers enabled events passing the later offline H — > 77 selection criteria to be 
collected with a trigger efficiency greater than 99.5%. 

5.2 Interaction vertex location 

In order to construct a photon four-momentum from the measured ECAL energies and the im- 
pact position determined during the supercluster reconstruction, the photon production vertex, 
i.e. the origin of the photon trajectory, must be determined. Without incorporating any addi- 
tional information, any of the reconstructed pp event vertices is potentially the origin of the 
photon. If the distance in the longitudinal direction between the assigned and the true inter- 
action point is larger than 10 mm, the resulting contribution to the diphoton mass resolution 
becomes comparable to the contribution from the ECAL energy resolution. It is, therefore, de- 
sirable to use additional information to assign the correct interaction vertex for the photon with 
high probability. This can be achieved by using the kinematic properties of the tracks associ- 
ated with the vertices and exploiting their correlation with the diphoton kinematic properties, 
including the transverse momentum of the diphoton (pj^). In addition, if either of the photons 
converts into an e+e^ pair and the tracks from the conversion are reconstructed and identified, 
the direction of the converted photon, determined by combining the conversion vertex posi- 
tion and the position of the ECAL supercluster, can be extrapolated to identify the diphoton 
interaction vertex. 

For each reconstructed interaction vertex the following set of variables are calculated: the sum 
of the squared transverse momenta of all tracks associated with the vertex and two variables 
that quantify the pj balance with respect to the diphoton system. In the case of a reconstructed 
photon conversion, an additional "pull" variable is used, defined as the distance between the 
vertex z position and the beam-line extrapolated z position coming from the conversion recon- 
struction, divided by the uncertainty in this extrapolated z position. These variables are used 
as input to a BDT algorithm trained on simulated Higgs signal events and the interaction point 
ranking highest in the constructed classifier is chosen as the origin of the photons. 



5.3 Photon selection 
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Figure 4: Comparison of the vertex-identification efficiency between data (circles) and MC sim- 
ulated 7.^ }i}i events (squares), as a function of the Z boson pj. 



The vertex-finding efficiency, defined as the efficiency to locate the vertex to within 10 mm of its 
true position, is studied using Z — > }i}i events where the muon tracks were removed from the 
tracks considered, and the muon momenta were replaced by the photon momenta. The result 
is shown in Fig. |4] The overall efficiency in signal events with a Higgs boson mass of 120 GeV, 
integrated over its pj spectrum, is (83.0 ± 0.4)% in the 7TeV data set, and (79.0 ± 0.2)% in the 
8 TeV data set. The statistical uncertainties in these numbers are propagated to the uncertainties 
in the final result. 

A second vertex related multivariate discriminant is employed to estimate, event-by-event, the 
probability for the vertex assignment to be within 10 mm of the diphoton interaction point. 
This BDT is trained using simulated H — ?► 77 events. The input variables are the classifier 
values of the vertex BDT described above for the three vertices with the highest score BDT 
values, the number of vertices, the diphoton transverse momentum, the distances between the 
chosen vertex and the second and third choices, and the number of photons with an associated 
conversion track. These variables allow for a reliable quantification of the probability that the 
selected vertex is close to the diphoton interaction point. 

The resulting vertex-assignment probability from simulated events is used when constructing 



the Higgs boson signal models. The signal modelling is described in Section 5.5 
5.3 Photon selection 

The event selection requires two photon candidates with transverse momenta satisfying p j (1 ) > 
and Pj(2) > m-y^/ 4, where m^^ is the diphoton invariant mass, within the ECAL fidu- 
cial region \r]\ < 2.5, and excluding the barrel-endcap transition region 1.44 < < 1.57. 
The fiducial region requirement is applied to the supercluster position in the ECAL and the px 
threshold is applied after the vertex assignment. The requirements on the mass-scaled trans- 
verse momenta are mainly motivated by the fact that by dividing the transverse momenta 
by the diphoton mass, turn-on effects on the background-shape in the low mass region are 
strongly reduced. In the rare cases where the event contains more than two photons passing 
all the selection requirements, the pair with the highest summed (scalar) pj is chosen. 
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The relevant backgrounds in the H — ?► 77 channel consist of the irreducible background from 
prompt diphoton production, i.e. processes in which both photons originate directly from the 
hard-scattering process, and the reducible backgrounds from 7 + jet and dijet events, where 
the objects misidentified as photons 

correspond to particles in jets that deposit substantial energy in the ECAL, typically photons 
from the decay of isolated 71° or rf mesons. These misidentified objects are referred to as fake or 
nonprompt photons. 

In order to optimize the photon identification to exclude such nonprompt photons, a BDT clas- 
sifier is trained using simulated pp — > 7 + jet event samples, where prompt photons are used 
as the signal and nonprompt photons as the background. The variables used in the training are 
divided into two groups. The first contains information on the detailed electromagnetic shower 
topology, the second has variables describing the photon isolation, i.e. kinematic information 
on the particles in the geometric neighbourhood of the photon. Examples of variables in the 
first group are the energy-weighted shower width of the cluster of ECAL crystals assigned to 
the photon and the ratio of the energy of the most energetic 3x3 crystal cluster to the total 
cluster energy. The isolation variables include the magnitude of the sum of the transverse mo- 
menta of all other reconstructed particles inside a cone of size AR = 0.3 around the candidate 
photon direction. In addition, the geometric position of the ECAL crystal cluster, as well as the 
event energy density p, are used. 

Instead of having a requirement on the trained multivariate classifier value to select photons 
with a high probability of being prompt photons, the classifier value itself is used as input to 
subsequent steps of the analysis. To reduce the number of events, a loose requirement is im- 
posed on the classifier value (> — 0.2) for candidate photons to be considered further. This 
requirement retains more than 99% of signal photons. The efficiency of this requirement, as 
well as the differential shape of the classifier variable for prompt photons, have been studied 
by comparing Z — > ee data to simulated events, given the similar response of the detector to 
photon and electrons. The comparisons between the differential shape in data and MC sim- 
ulation for the 8 TeV analysis are shown in Fig. |5| for electrons in the barrel (left) and endcap 
(right) regions. 

5.4 Event classification 

The strategy of the analysis is to look for a narrow peak over the continuum in the diphoton 
invariant-mass spectrum. To increase the sensitivity of the search, events are categorized ac- 
cording to their expected diphoton mass resolution and signal-to-background ratio. Categories 
with good resolution and a large signal-to-background ratio dominate the sensitivity of the 
search. To accomplish this, an event classifier variable is constructed based on multi-variate 
techniques, that assigns a high classifier value to events with signal-like kinematic character- 
istics and good diphoton mass resolution, as well as prompt-photon-like values for the pho- 
ton identification classifier. However, the classifier should not be sensitive to the value of the 
diphoton invariant mass, in order to avoid biasing the mass distribution that is used to extract 
a possible signal. To achieve this, the input variables to the classifier are made dimension- 
less. Those that have units of energy (transverse momenta and resolutions) are divided by 
the diphoton invariant-mass value. The variables used to train this diphoton event classifier 
are the scaled photon transverse momenta (p / niyy and pj(2)/m^^), the photon pseudo- 
rapidities ('^(1) and rj{2)), the cosine of the angle between the two photons in the transverse 
plane (cos ((/'(I) — <p{2))), the expected relative diphoton invariant-mass resolutions under the 
hypotheses of selecting a correct/incorrect interaction vertex {Um"'^^^ ^™'^°"'^'^'V»z-y^), the proba- 
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Figure 5: Comparison of the photon identification (ID) classifier variable distribution between 
8 TeV data (points) and MC simulated events (histogram), separated into barrel (left) and end- 
cap (right) electrons originating from Z — > ee events. The uncertainties in the distributions 
from simulation are shown by the cross-hatched histogram. 



bility of selecting a correct vertex (pvtx)/ arid the photon identification classifier values for both 
photons. Thec7-™^™'''V 

ntryj is computed using the single photon resolution estimated by 
the dedicated BDT described in Section 3. A vertex is being labeled as correct if the distance 
from the true interaction point is smaller than 10 mm. 

To ensure the classifier assigns a high value to events with good mass resolution, the events are 
weighted by a factor inversely proportional to the mass resolution. 



^ = + . ^ - . (2) 



This factor incorporates the resolutions under both correct- and incorrect-interaction-vertex 
hypotheses, properly weighted by the probabilities of having assigned the vertex correctly. 
The training is performed on simulated background and Higgs boson signal events. 

The uncertainties in the diphoton event classifier output come from potential mismodelling of 
the input variables. The dominant sources are the uncertainties in the shapes of the photon 
identification (ID) classifier and the individual photon energy resolutions, which are used to 
compute the relative diphoton invariant-mass resolutions. 

The first of these amounts to a potential shift in the photon ID classifier value of at most ±0.01 
in the 8 TeV and ±0.025 in the 7 TeV analysis. These values are set looking to the observed differ- 
ences between the photon ID classifier value distributions from data and simulation. This com- 
parison for the 7 TeV analysis is shown in Fig. |6| where the distribution for the leading (highest 
Pt) candidate photons in the ECAL barrel (left) and endcaps (right) are compared between 
data and MC simulation for niyj > 160 GeV, where most photons are prompt ones. In addi- 



tion to the three background components described in Section 5.3 (prompt-prompt, prompt- 
nonprompt, and nonprompt-nonprompt), the additional component composed by Drell-Yan 
events, in which both final-state electrons are misidentified as photons, has been studied and 
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Figure 6: Distribution of the photon ID classifier value for the larger transverse momentum 
photon in the ECAL barrel (left) and endcaps (right) from candidate diphoton data events 
(points) with m-y-y > 160 GeV. The predicted distributions for the various diphoton back- 
grounds as determined from simulation are shown by the histograms. The variations of the 
classifier value due to the systematic uncertainties are shown by the cross-hatched histogram. 



found to be negligible. As discussed previously a variation of the classifier value by ±0.025, 
represented by the cross-hatched histogram, covers the differences. 

For the second important variable, the photon energy resolution estimate (calculated by a BDT, 
as discussed in Section 3), a similar comparison is shown in Fig. [7[ Again, the 7 TeV data dis- 
tributions for candidate photons in the ECAL barrel (left) and endcap (right) are compared to 
MC simulation for m^j > 160 GeV. The systematic uncertainty of ±10% is again shown as the 
cross-hatched histogram. 

The effect of both these uncertainties propagated to the diphoton event classifier distribution 
can be seen in Fig. |8| where the 7 TeV data diphoton classifier variable is compared to the MC 
simulation predictions. The data and MC simulation distributions in both the left and right 
plots of Fig. |8] are the same. In the left plot, the uncertainty band arises from propagating the 
photon ID classifier uncertainty, while in the right plot, it is from propagating the energy reso- 
lution uncertainty. From these plots one can see that the uncertainty in the photon ID classifier 
dominates the overall uncertainty, and by itself almost covers the full difference between the 
data and MC simulation distributions. Both uncertainties are propagated into the final result. 

The diphoton event classifier output is then used to divide events into different classes, prior 
to fitting the diphoton invariant-mass spectrum. The procedure successively splits events into 
classes by introducing a boundary value for the diphoton classifier output. The first boundary 
results in two classes, and then these classes are further split. Each split is introduced using 
the boundary value that gives rise to the best expected exclusion limit. The procedure is termi- 
nated once additional splitting results in a negligible (<1%) gain in sensitivity. Additionally, 
the lowest score class is dropped since it does not contribute significantly to the sensitivity. This 
procedure results in four event classes for both the 7 and 8 TeV data sets. The systematic uncer- 
tainties in the diphoton identification classifier and photon energy resolution discussed above 
can cause events to migrate between classes. In the 8 TeV analysis, these class migrations are 



5.4 Event classification 



17 




Figure 7: Distribution of the photon resolution estimate cr^ / £ for the leading photon in the 
ECAL barrel (left) and endcaps (right) from candidate diphoton data events (points) with 
niyj > 160 GeV. The predicted distributions for the various diphoton backgrounds, as de- 
termined from simulation, are shown by the histograms. The variations of the resolution due 
to the systematic uncertainties are shown by the cross-hatched histogram. 




Figure 8: The effect of the systematic uncertainty assigned to the photon identification classifier 
output (left) and the photon resolution estimate (right) on the diphoton BDT output for back- 
ground MC simulation (100 GeV < niyy < 180 GeV) and for data. The nominal BDT output 
is shown as a stacked histogram and the variation due to the uncertainty is shown as a cross- 
hatched band. These plots show only the systematic uncertainties that are common to both 
signal and background. There are additional significant uncertainties that are not shown here. 
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up to 4.3% and 8.1%, respectively. They are defined as the relative change of expected signal 
yield in each category under the variation of the photon ID BDT classifier and the per-photon 
energy resolution estimate, within their uncertainties as explained above. 

The sensitivity of the analysis is enhanced by using the special kinematics of Higgs bosons 
produced by the VBF process [110 J. Dedicated classes of events are selected using dijet-tagging 
criteria. The 7 TeV data set has one class of dijet-tagged events, while the 8 TeV data set has two. 

In the 7 TeV analysis, dijet-tagged events are required to contain two jets with transverse ener- 
gies exceeding 20 and 30 GeV, respectively. The dijet invariant mass is required to be greater 
than 350 GeV, and the absolute value of the difference of the pseudorapidities of the two jets 
has to be larger than 3.5. In the 8 TeV analysis, dijet-tagged events are required to contain two 
jets and are categorized as "Dijet tight" or "Dijet loose". The jets in Dijet tight events must 
have transverse energies above 30 GeV and a dijet invariant mass greater than 500 GeV. For 
the jets in the Dijet loose events, the leading (sub leading) jet transverse energy must exceed 30 
(20) GeV and the dijet invariant mass be greater than 250 GeV, where leading and subleading 
refer to the jets with the highest and next-to-highest transverse momentum, respectively. The 
pseudorapidity separation between the two jets is also required to be greater than 3.0. Ad- 
ditionally, in both analyses the difference between the average pseudorapidity of the two jets 
and the pseudorapidity of the diphoton system must be less than 2.5 [11111, and the difference 
in azimuthal angle between the diphoton system and the dijet system is required to be greater 
than 2.6 radians. To further reduce the background in the dijet classes, the pj threshold on the 
leading photon is increased to Pj(l) > niyy/l. 

Systematic uncertainties in the efficiency of dijet tagging for signal events arise from the uncer- 
tainty in the MC simulation modelling of the jet energy corrections and resolution, and from 
uncertainties in simulating the number of jets and their kinematic properties. These uncertain- 
ties are estimated by using different underlying-event tunes, PDFs, and renormalization and 
factorization scales as suggested in Refs. ||25l|26l. A total systematic uncertainty of 10% is as- 
signed to the efficiency for VBF signal events to pass the dijet-tag criteria, and an uncertainty 
of 50%, dominated by the uncertainty in the underlying-event tune, to the efficiency for signal 
events produced by gluon-gluon fusion. 

Table|3]shows the predicted number of signal events for a SM Higgs boson with ?«h = 125 GeV, 
as well as the estimated number of background events per GeV of invariant mass at m^j = 
125 GeV, for each of the eleven event classes in the 7 and 8 TeV data sets. The table also gives 
the fraction of each Higgs boson production process in each class (as predicted by MC simula- 
tion) and the mass resolution, represented both as c^gg, half the width of the narrowest interval 
containing 68.3% of the distribution, and as the full-width-at-half-maximum (FWHM) of the 
invariant-mass distribution divided by 2.35. 

5.5 Signal and background modelling 

The modelling of the Higgs boson signal used in the estimation of the sensitivity has two as- 
pects. First, the normalization, i.e. the expected number of signal events for each of the consid- 
ered Higgs boson production processes; second, the diphoton invariant-mass shape. To model 
both aspects, including their respective uncertainties, the MC simulation events and theoretical 
considerations described in Section|4]are used. To account for the interference between the sig- 
nal and background diphoton final states P112L the expected gluon-gluon fusion process cross 
section is reduced by 2.5% for all values of mn. 

Additional systematic uncertainties in the normalization of each event class arise from potential 



5.5 Signal and background modelling 



19 



Table 3: Expected number of SM Higgs boson events (wjh = 125 GeV) and estimated back- 
ground (at niyry = 125 GeV) for the event classes in the 7 (5.1 fb^^) and 8 TeV (5.3 fb^^) data sets. 
The composition of the SM Higgs boson signal in terms of the production processes and its 
mass resolution are also given. 



Event classes 


SM Higgs boson expected signal (wh = 


125 GeV) 


Background 


Events 


ggH 


VBF 


VH 


ttH 


C^eff 
(GeV) 


FWHM/2.35 
(GeV) 


m^j = 125 GeV 
(events / GeV) 




BDTO 


3.2 


61% 


17% 


19% 


3% 


1.21 


1.14 


3.3 ±0.4 




BDTl 


16.3 


88% 


6% 


6% 




1.26 


1.08 


37.5 ±1.3 


7 TeV 


BDT2 


21.5 


92% 


4% 


4% 




1.59 


1.32 


74.8 ±1.9 




BDT3 


32.8 


92% 


4% 


4% 




2.47 


2.07 


193.6 ±3.0 




Dijet tag 


2.9 


27% 


72% 


1% 




1.73 


1.37 


1.7 ±0.2 




BDTO 


6.1 


68% 


12% 


16% 


4% 


1.38 


1.23 


7.4 ± 0.6 




BDTl 


21.0 


87% 


6% 


6% 


1% 


1.53 


1.31 


54.7 ±1.5 


8 TeV 


BDT2 


30.2 


92% 


4% 


4% 




1.94 


1.55 


115.2 ±2.3 


BDT3 


40.0 


92% 


4% 


4% 




2.86 


2.35 


256.5 ± 3.4 




Dijet tight 


2.6 


23% 


77% 






2.06 


1.57 


1.3 ±0.2 




Dijet loose 


3.0 


53% 


45% 


2% 




1.95 


1.48 


3.7 ±0.4 



class-to-class migration of signal events caused by uncertainties in the diphoton event classifier 
value. The instrumental uncertainties in the classifier value and their effect have been discussed 
previously. The theoretical ones, arising from the uncertainty in the theoretical predictions 
for the photon kinematics, are estimated by measuring the amount of class migration under 
variation of the renormalization and factorization scales within the range mn/l < }i < Itnyi, 
(class migrations up to 12.5%) and the PDFs (class migrations up to 1.3%). These uncertainties 
are propagated to the final statistical analysis. 

To model the diphoton invariant-mass spectrum properly, it is essential that the simulated 
diphoton mass and scale are accurately predicted. This is done by comparing the dielectron 
invariant-mass distribution in Z — > ee events between data and MC simulation, where the 
electrons have been reconstructed as photons. This comparison is shown for the 8 TeV data 
in Fig. |9| where the points represent data, and the histogram MC simulation. Before correc- 
tion, the dielectron invariant-mass distribution from simulation is narrower than the one from 
data, caused by an inadequate modelling of the photon energy resolution in the simulation. 
To correct this effect, the photon energies in the Higgs boson signal MC simulation events are 
smeared and the data events scaled, so that the dielectron invariant-mass scale and resolution 
as measured in Z — ?► ee events agree between data and MC simulation. These scaling and 
smearing factors are determined in a total of eight photon categories, i.e. separately for pho- 
tons in four pseudorapidity regions (|//| < 1, 1 < \rj\ < 1.5, 1.5 < \rj\ < 2, and \r]\ > 2), and 
separately for high R9 (>0.94) and low R9 (<0.94) photons, where R9 is the ratio of the energy 
of the most energetic 3x3 crystal cluster and the total cluster energy. 

Additionally, the factors are computed separately for different running periods in order to ac- 
count for changes in the running conditions, for example the change in the average beam in- 
tensity. These modifications reconcile the discrepancy between data and simulation, as seen in 
the comparison of the dots and solid curve of Fig. |9] The uncertainties in the scaling and smear- 
ing factors, which range from 0.2% to 0.9% depending on the photon properties, are taken as 
systematic uncertainties in the signal evaluation and mass measurement. 
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Figure 9: Comparison of the dielectron invariant-mass spectrum from Z — )■ ee events between 
8 TeV data (points) and the simulated events (histogram), where the selected electrons are re- 
constructed as photons. The simulated distribution after applying smearing and scaling cor- 
rections of the electron energies is shown by the solid line. 



The final signal model is then constructed separately for each event class and each of the four 
production processes as the weighted sum of two submodels that assume either the correct 
or incorrect primary vertex selection (as described in Section [5!2] |. The two submodels are 
weighted by the corresponding probability of picking the right (pvtx) or wrong (1 — pvtx) vertex. 
The uncertainty in the parameter pvtx is taken as a systematic uncertainty. 

To describe the signal invariant-mass shape in each submodel, two different approaches are 
used. In the first, referred to as the parametric model, the MC simulated diphoton invariant- 
mass distribution is fitted to a sum of Gaussian distributions. The number of Gaussian func- 
tions ranges from one to three depending on the event class, and whether the model is a correct- 
or incorrect-vertex hypothesis. The systematic uncertainties in the signal shape are estimated 
from the variations in the parameters of the Gaussian functions. In the second approach, re- 
ferred to as the binned model, the signal mass shape for each event class is taken directly 
from the binned histogram of the corresponding simulated Higgs boson events. The system- 
atic uncertainties are included by parametrizing the change in each bin of the histogram as a 
linear function under variation of the corresponding nuisance parameter, i.e. the variable that 
parametrizes this uncertainty in the statistical interpretation of the data. The two approaches 
yield consistent final results and serve as an additional verification of the signal modelling. The 
presented results are derived using the parametric-model approach. 

The parametric signal models for a Higgs boson mass of 120 GeV in two of the 8 TeV BDT event 



classes are shown in Fig. 10 The signal models are summed over the four production processes. 



each weighted by their respective expected yield as computed from MC simulation. The two 



plots in Fig. 10 illustrate how the diphoton invariant-mass resolution improves with increasing 
diphoton classifier value. The left distribution is for classifier values greater than 0.88 and has 
a mass resolution tJgff = 1.34 GeV, while the right distribution is for classifier values between 
—0.05 and 0.50 and has c^eff = 2.77 GeV. This is the intended behaviour of the event class 
implementation. 



The uncertainties in the weighting factors for each of the production processes arise from varia- 
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Figure 10: Comparison of the diphoton invariant-mass distribution from the parametric signal 
model (blue line) and simulated MC events (open squares) for a Higgs mass hypothesis of 
niu = 120 GeV for two (BDT on the left, BDT 3 on the right) of the four 8TeV BDT event 
classes. 



tions in the renormalization and factorization scales, and uncertainties in the PDFs. They range 
from several percent for associated production with W/ Z to almost 20% for the gluon-gluon 
fusion process. The detailed values for the 8 TeV analysis, together with all the other systematic 
uncertainties discussed above, are summarized in Table |4] The corresponding uncertainties in 
the 7 TeV analysis are very similar, with the exception of the already mentioned uncertainty on 
the photon ID classifier, which was significantly larger in the 7 TeV analysis. The reason for this 
is a worse agreement between data and MC simulation. 

In addition to the per-photon energy scale uncertainties, that are derived in the eight rj — R9 cat- 
egories, additional fully correlated energy scale uncertainties are assigned in order to account 
for possible non-linearity as a function of energy and for additional electron-photon differ- 
ences. The uncertainty associated with possible non-linearities in the energy measurement as 
a function of the cluster energy are evaluated by measuring the energy scale of Z — > ee events 
as a function of the scalar sum of transverse momentum of the two electrons. The change in 
energy scale due to possible non-linearities in the energy measurement is estimated around 
0.2%; since this correction is not applied, a systematic uncertainty of 0.4% is assigned. An addi- 
tional fully correlated uncertainty related to difference of 0.25% between electron and photon 
is assigned, amounting to half of the absolute energy scale difference between electrons and 
photons for non-showering electrons /photons in the barrel. Adding these two numbers in 
quadrature results in the additional energy scale uncertainty of 0.47%, that is treated as fully 
correlated among all event classes. 

The modelling of the background relies entirely on the data. The observed diphoton invariant- 
mass distributions for the eleven event classes (five in the 7 and eight in the 8 TeV analysis) are 
fitted separately over the range 100 < niyy < 180 GeV. This has the advantage that there are 
no systematic uncertainties due to potential mismodelling of the background processes by the 
MC simulation. The procedure is to fit the diphoton invariant-mass distribution to the sum 
of a signal mass peak and a background distribution. Since the exact functional form of the 
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Table 4: Largest sources of systematic uncertainty in the analysis of the 8 TeV data set. Eight 

photon categories are defined, depending on their tj and R9, where R9 is the ratio of the energy 
of the most energetic 3x3 crystal cluster and the total cluster energy. The four pseudorapidity 
regions are: \f]\ < 1 (low f]), 1 < < 1.5 (high rj) for the barrel, and 1.5 < < 2 (low rj), 
\tj\>2 (high rj) for the endcaps; the two R9 regions are: high R9 (> 0.94) and low R9 (<0.94). 



Sources of systematic uncertainty 


Uncertainty 


Per photon 


Barrel 


Endcap 


Photon selection efficiency 

Energy resolution (Au/Emc) ^9 > 0.94 (low rj, high //) 

R9 < 0.94 (low r], high rj) 

Energy scale ((Ejata - Emc)/Emc) R9 > 0.94 (low rj, high i]) 

R9 < 0.94 (low T], high t]) 


0.8% 
0.22%, 0.60% 
0.24%, 0.59% 
0.19%, 0.71% 
0.13%, 0.51% 


2.2% 
0.90%, 0.34% 
0.30%, 0.52% 
0.88%, 0.19% 
0.18%, 0.28% 


Energy scale (fully correlated) 


0.47% 


Photon identification classifier 


0.01 


Photon energy resolution BDT 


10% 


Per event 


Integrated luminosity 
Vertex finding efficiency 

Trigger efficiency — One or both photons R9 < 0.94 in endcap 

Other events 


4.4% 
0.2% 
0.4% 
0.1% 


Dijet selection 


Dijet tagging efficiency VBF 

Gluon-gluon fusion 


10% 
50% 


Production cross sections 


Scale 


PDF 


Gluon-gluon fusion 
VBF 

Associated production with W/Z 
Associated production with tt 


+12.5% -8.2% 
+0.5% -0.3% 

1.8% 
+3.6% -9.5% 


+7.9% -7.7% 
+2.7% -2.1% 

4.2% 

8.5% 
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background in each event class is not known, the parametric model has to be flexible enough to 
describe an entire set of potential underlying functions. Using a wrong background model can 
lead to biases in the measured signal strength. Such a bias can, depending on the Higgs boson 
mass and the event class, reach or even exceed the size of the expected signal, and therefore 
dramatically reduce the sensitivity of the analysis to any potential signal. In what follows, a 
procedure for selecting the background function is described that results in a potential bias 
small enough to be neglected. 

If the true underlying background model could be used in the extraction of the signal strength, 
and no signal is present in the fitted data, the median fitted signal strength would be zero in 
the entire mass region of interest. The deviation of the median fitted signal strength from zero 
in background-only pseudo-experiments can thus be used to quantify the potential bias. These 
pseudodata sets are generated from a set of hypothetical truth models, with each model using 
a different analytical function that adequately describes the observed diphoton invariant-mass 
distribution. The set of truth-models contains exponential and power-law functions, as well 
as polynomials (Bernstein polynomials) and Laurent series of different orders. None of these 
functions is required to describe the actual (unknown) underlying background distribution. 
Instead, we argue that they span the phase-space of potential imderlying models in such a way 
that a fit model resulting in a negligible bias against all of them would also result in a negligible 
bias against the (imknown) true imderlying distribution. 

The first step in generating such pseudodata sets consists of constructing a truth model, from 
which the pseudodata set is drawn. This is done by fitting the data in each of the eleven event 
classes separately, and for each of the four general types of background functions, resulting 
in four truth-models for each event class. The order of the backgroimd function required to 
adequately describe the data for each of the models is determined by increasing the order until 
an additional increase does not result in a significant improvement of the fit to the observed 
data. A p(;^-goodness-of-fit is used to quantify the fit quality, and an F-test to determine the 
termination criterion. "Increasing the order" here means adding additional terms of higher 
order in the case of the polynomial and the Laurent series, and adding additional exponential 
or power-law terms with different parameters in the case of the exponential and power-law 
truth models. 

Once the four truth models are determined for a given event class, ~40 000 pseudodata sets are 
generated for each by randomly drawing diphoton mass values from them. The next step is 
then to find a function (in what follows referred to as fit model), that results in a negligible bias 
against all four sets of toy data in the entire mass region of interest, i.e. an analytical function 
that when used to extract the signal strength in all the 40 000 pseudodata sets, gives a mean 
value for the fitted strength consistent with zero. 

The criterion for the bias to be negligible is that it must be five times smaller than the statistical 
imcertainty in the number of fitted events in a mass window corresponding to the FWHM of 
the corresponding signal model. With this procedure, any potential bias from the background 
fit function can be neglected in comparison with the statistical uncertainty from the finite data 
sample. We find that only the polynomial background function produces a sufficiently small 
bias for all four truth models. Therefore, we only use this background function to fit the data. 
The required order of the polynomial function needed to reach the sufficiently small bias is 
determined separately for each of the 11 event classes, and ranges from 3 to 5. 

The entire procedure results in a background model for each of the event classes as a poly- 
nomial function of a given, class-dependent order. The parameters of this polynomial, i.e. the 
coefficients for each term, are left free in the fit, and their variations are therefore the only source 
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of uncertainty from the modelling of the background. 

The simultaneous fit to the signal-plus-background models, derived as explained above, to- 



gether with the m-y-y distributions for the data, are shown for the eleven event classes in Figs. 11 



and 12 for the 7 and 8 TeV data samples, respectively. The uncertainty bands shown in the 
background component of the fit arise from the variation of the background fit parameters, 
and correspond to the uncertainties in the expected background yield. The fit is performed 
on the data from all event class distributions simultaneously, with an overall floating signal 
strength. In these fits, the mass hypothesis is scanned in steps of 0.5 GeV between 110 and 
150 GeV. At the point with the highest significant excess over the background-only hypothesis 
(wjh = 125 GeV), the best fit value is a/asu = 1.56 ± 0.43. 

In order to better visualize any overall excess /significance in the data, each event is weighted 
by a class-dependent factor, and its corresponding diphoton invariant mass is plotted with that 
weight in a single distribution. The weight depends on the event class and is proportional to 
S/ (S + B), where S and B are the number of expected signal and background events in a mass 
window corresponding to 2t7'gff, centered on w-y-y = 125 GeV and calculated from the signal-plus- 
background fit to all data event classes simultaneously. The particular choice of the weights is 
motivated in Ref. II113L The resulting distribution is shown in Fig. [13| where for reference 
the distribution for the unweighted sum of events is shown as an inset. The binning for the 
distributions is chosen to optimize the visual effect of the excess at 125 GeV, which is evident 
in both the weighted and unweighted distributions. It should be emphasized that this figure is 
for visualization purposes only, and no results are extracted from it. 

5.6 Alternative analyses 

In order to verify the results described above, two alternative analyses are performed. The first 
(referred to as the cut-based analysis) refrains from relying on multivariate techniques, except 
for the photon energy corrections described in Section |3] Instead, the photon identification 
is performed by an optimized set of requirements on the discriminating variables explained in 



Section 5.3 Additionally, instead of using a BDT event-classifier variable to separate events into 
classes, the event classes are built using requirements on the photons directly. Four mutually 
exclusive classes are constructed by splitting the events according to whether both candidate 
photons are reconstructed in the ECAL barrel or endcaps, and whether the R9 variable exceeds 
0.94. This categorization is motivated by the fact that photons in the barrel with high R9 values 
are typically measured with better energy resolution than ones in the endcaps with low R9. 
Thus, the classification serves a similar purpose to the one using the BDT event classifier: events 
with good diphoton mass resolution are grouped together into one class. The four event classes 
used in this analysis are then: 



• both photons are in the barrel, with R9 > 0.94, 

• both photons are in the barrel and at least one of them with R9 < 0.94, 

• at least one photon is in the endcap and both photons with R9 > 0.94, 

• at least one photon is in the endcap and at least one of them with R9 < 0.94. 

The second alternative analysis (referred to as the sideband analysis) uses the identical multi- 
variate technique as the baseline analysis, as well as an identical event sample, but relies on dif- 
ferent procedures to model the signal and background contributions. This approach uses data 
in the sidebands of the invariant mass distribution to model the background. Consequently, 
this analysis is much less sensitive to the parametric form used to describe the diphoton mass 
spectrum and allows the explicit inclusion of a systematic uncertainty for the possible bias in 
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Figure 11: The diphoton invariant-mass distributions for the five classes of the 7 TeV data set 
(points) and the results of the signal-plus-background fits for rtiyj = 125 GeV (lines). The back- 
ground fit components are shown by the dotted lines. The light and dark bands represent the 
±1 and ±2 standard deviation uncertainties, respectively, on the background estimate. 
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Figure 12: The diphoton invariant-mass distributions for the six classes of the 8 TeV data set 
(points) and the results of the signal-plus-background fits for nijj = 125 GeV (lines). The back- 
ground fit components are shown by the dotted lines. The light and dark bands represent the 
±1 and ±2 standard deviation uncertainties, respectively, on the background estimate. 
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Figure 13: The diphoton invariant-mass distribution for the 7 and 8 TeV data sets (points), with 
each event weighted by the predicted S / (S + B) ratio of its event class. The solid and dotted 
lines give the results of the signal-plus-background and background-only fit, respectively. The 
light and dark bands represent the ±1 and ±2 standard deviation uncertainties respectively 
on the background estimate. The inset shows the corresponding unweighted invariant-mass 
distribution around m^^ = 125 GeV. 
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the background mass fit. For any given mass hypothesis nin, a signal region is defined to be 
in the range ±2% on either side of mn- A contiguous set of sidebands is defined in the mass 
distribution on either side of the signal region, from which the background is extracted. Each 
sideband is defined to have the equivalent width of ±2% relative to the mass hypothesis that 
corresponds to the centre of the sideband. A total of six sidebands are used in the analysis 
(three on either side of the signal region), with the two sidebands adjacent to the signal region 



omitted in order to avoid signal contamination, as illustrated in Fig. 14 




100 110 120 130 140 150 160 170 180 

m.„ (GeV) 

Figure 14: The six sidebands (dashed lines) around the signal region (solid line) in the sideband 
analysis. 



The result is extracted by counting events in the signal region, in classes that are defined by 
the output distribution of a BDT. This mass-window BDT takes two dimensionless inputs: the 
diphoton BDT output (as described in Section [5!4)|, and the mass, in the form Am/ niw, where 



Am = m^-y — wjh and jwh is the Higgs boson mass hypothesis. The output of the BDT is binned 
to define the event classes. The bin boundaries are optimized to give the maximum expected 
significance in the presence of a Standard Model Higgs boson signal, and the number of bins 
is chosen such that any additional increase in the number of bins results in an improvement in 
the expected significance of less than 0.1%. The same bin boundaries are used for the signal 
region and for the six sidebands. The dijet-tagged events constitute an additional bin (two bins 
for the 8 TeV data set) appended to the bins of the mass-window BDT output value. 

The background model (i.e. the BDT output distribution for background events in the signal 
region) is constructed from the BDT output distributions of the data in each of the six side- 
bands. The only assumptions made concerning the background model shape, both verified 
within the assigned systematic errors, are that the fraction of events in each BDT output bin 
varies linearly as a function of invariant mass (and thus with sideband position), and that there 
is negligible signal contamination in the sidebands. Only the overall normalization of the back- 
ground model (the total number of background events in the signal region) is obtained from 
a parametric fit to the mass spectrum. The signal region is excluded from this fit. The bias in- 
curred by the choice of the functional form used in the fit has been studied in a similar fashion 



to that described in Section 5.5 and is covered with a systematic uncertainty of 1%. 



The mass-window BDT is trained using simulated Higgs boson events with mn = 123 GeV 
and simulated background events, including prompt-prompt, prompt-fake, and fake-fake pro- 
cesses. The training samples are not used in any other part of the analysis, except as input to 
the binning algorithm, thus avoiding any biases from overtraining. 
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The signal region for mass hypothesis mn = 125 GeV is estimated from simulation to contain 
93% of the signal. The number of expected signal events in each bin is determined using MC 
simulation, as in the baseline analysis. Systematic uncertainties in the signal modelling lead to 
event migrations between the BDT bins, that are accounted for as additional nuisance parame- 
ters in the limit-setting procedure. 



Examples of distributions in this analysis are shown in Fig. 15 for the 7 (left) and 8 TeV (right) 
data sets. The different event classes are listed along the x axis. The first seven classes are the 
mass-window BDT classes. They are ordered by increasing expected signal-to-background ra- 
tio. The class labeled as "Dijet" contains the dijet-tagged events. The number of data events, 
displayed as points, is compared to the expected background events determined from the side- 
band population, shown by the histogram. The expected signal yield for a Higgs boson mass 
of jwh = 125 GeV is shown with the dotted line. 
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Figure 15: The number of observed events (points) for each of the mass- window BDT classes 
in the sideband analysis of H — > 77 for the 7 (left) and 8 TeV (right) data sets. The expected 
number of background events in each class, determined from the sidebands of the diphoton 
invariant-mass distribution, is shown by the solid line. The dark and light bands display the 
±1 and ±2 standard deviation uncertainties in the background predictions, respectively. The 
expected number of signal events in each class for a 125 GeV Higgs boson, as determined from 
MC simulation, is shown by the dotted line. 



The statistical interpretation of the results is given in Section 10. 



6 H ^ ZZ 

6.1 Event selection and kinematics 

The search for the decay H ZZ 4£ with I = e,}i\s performed by looking for a narrow four- 
lepton invariant-mass peak in the presence of a small continuum background. The background 
sources include an irreducible four-lepton contribution from direct ZZ (Z7*) production via 
the qq annihilation and gg fusion processes. Reducible contributions arise from Z + bb and 
tt production, where the final state contains two isolated leptons and two b-quark jets that 
produce two nonprompt leptons. Additional background arises from Z+jets and WZ+jets 
events, where jets are misidentified as leptons. Since there are differences in the reducible 
background rates and mass resolutions between the subchannels 4e, 4^, and 2e2}i, they are 
analyzed separately and the results are then combined statistically. 

Compared to the first CMS ZZ — > 4£ analysis reported in Ref. pl4| , this analysis employs im- 
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proved muon reconstruction, lepton identification and isolation, recovery of final-state-radiation 
(FSR) photons, and the use of a kinematic discriminant that exploits the expected decay kine- 
matics of the signal events. New mass and spin-parity results obtained from a H — )■ ZZ — ?► A£ 
analysis using additional integrated luminosity at the centre-of-mass energy of 8 TeV are de- 
scribed in a recent CMS publication II115L and not discussed further here. 

Candidate events are first selected by triggers that require the presence of a pair of electrons or 
muons. An additional trigger requiring an electron and a muon in the event is also used for 
the 8 TeV data. The requirements on the minimum pj of the two leptons are 17 and 8 GeV. The 
trigger efficiency is determined by first adjusting the simulation to reproduce the efficiencies 
obtained on single lepton legs in special tag-and-probe measurements, and then using the sim- 
ulation to combine lepton legs within the acceptance of the analysis. The efficiency for a Higgs 
boson of mass > 120 GeV, is greater than 99% (98%, 95%) in the 4^ {2e2}i, 4e) channel. The 
candidate events are selected using identified and isolated leptons. The electrons are required 
to have transverse momentum p| > 7 GeV and pseudorapidity within the tracker geometri- 
cal acceptance of l?/*^! < 2.5. The corresponding requirements for muons are pj > 5 GeV and 
\t]<-'\ < 2.4. No gain in expected significance for a Higgs boson signal is obtained by lower- 
ing the Pj thresholds for the leptons, since the improvement in signal detection efficiency is 
accompanied by a large increase in the Z+jets background. 

The lepton-identification techniques have been described in Section |3] The multivariate elec- 
tron identification is trained using a Higgs boson MC simulation sample for the H — )■ ZZ signal 
and a sample of W+l-jet events from data for the background. The working point is optimized 
using a Z+l-jet data sample. For each lepton, £ = e, ji, an isolation requirement of R^^ < 0.4 is 
applied to suppress the Z+jet, Z+bb, and tt backgrounds. In addition, the lepton impact param- 
eter significance with respect to the primary vertex, defined as SIP3D = with IP the impact 
parameter in three dimensions and (7ip its uncertainty, is used to further reduce background. 
The criteria of |SIP3d| < 4 suppresses the Z + bb and tt backgrounds with negligible effect on 
the signal efficiency. 

The efficiencies for reconstruction, identification, and isolation of electrons and muons are mea- 
sured in data, using a tag-and-probe technique 111 161 based on an inclusive sample of Z ^ ££ 
events. The measurements are performed in bins of pj and \f]\. Additional samples of dilep- 
tons with pj < 15 GeV from ]/ip decays are used for the efficiency measurements (in the case of 
muons) or for consistency checks (in the case of electrons). Examples of tag-and-probe results 
for the lepton identification efficiencies obtained with data and MC simulation are shown for 



electrons (top) and muons (bottom) in Fig. 16 The efficiencies measured with data are in agree- 
ment with those obtained using MC simulation. The mean differences (at the percent level) are 
used to correct the MC simulation predictions, and the uncertainty in the difference is propa- 
gated as a systematic uncertainty per lepton. The overall lepton selection efficiencies are ob- 
tained as the product of the reconstruction, identification, and isolation efficiencies. The overall 
efficiency for selecting electrons in the ECAL barrel (endcaps) varies from about 71% (65%) for 
7 < p| < 10 GeV to 82% (73%) at p| ~ 10 GeV, and reaches 90% (89%) for p| ~ 20 GeV. The 
efficiency for electrons drops to about 85% in the transition region, 1.44 < |//^ | < 1.57, between 
the ECAL barrel and endcaps. The muons are selected with an efficiency above 98% in the full 
\r]^'\ < 2.4 range for pj > 5 GeV. 

Photons reconstructed with pseudorapidity |?/^ | < 2.4 are possible FSR candidates. The photon 
selection criteria are optimized as a function of the angular distance between the photon and 
the closest lepton in [t], (p) space. In an inner cone AR = 0.07, photons are accepted if pj > 
2 GeV, with no further requirements. In an outer annulus 0.07 < AR < 0.5, where the rate 
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Figure 16: Measurements of the lepton identification efficiency using a tag-and-probe technique 
based on samples of Z and J/t/? dilepton events. The measurements are shown for electrons (top) 
at 7 TeV and muons (bottom) at 8 TeV as a function of p j for the | rj \ regions of the barrel (left) 
and endcaps (right). For muons, the efficiencies at pj < 15 GeV (dashed line on bottom plots) 
is obtained using ]/xp. The results obtained from data (points with error bars) are compared 
to results obtained from MC simulation (histograms), with the shaded region representing the 
combined statistical and systematic uncertainties. 
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of photons from the underlying event and pileup is much larger, a tighter threshold of 4 GeV 
is used, and the photons are also required to be isolated: the sum of the px of all charged 
hadrons, neutral hadrons, and photons in a cone of radius AR = 0.3 centred on the photon 
should not exceed the pj of the photon itself. In contrast to lepton isolation, and in order to 
take into account the fact that the photon might come from a pileup interaction, the photon 
isolation also uses the charged hadrons associated with other primary vertices. The selection 
criteria have been tuned to achieve approximately the same purity in the two angular regions. 
When reconstructing the Z — > candidates, only FSR photons associated with the closest 
lepton, and that make the dilepton-plus-photon invariant mass closer to the nominal Z mass 
than the dilepton invariant mass, are kept. The dilepton-plus-photon invariant mass must also 
be less than 100 GeV. The performance of the FSR selection algorithm is measured using MC 
H — > ZZ simulation samples, and the rate is verified with inclusive Z-boson events in data. 
Photons within the acceptance for the FSR selection are measured with an efficiency of ~50% 
and a mean purity of 80%. The FSR photons are selected in 5% of inclusive Z-boson events in 
the muon channel and 0.5% in the electron channels. In the case of electrons, the FSR photons 
are often implicitly combined into the electron superclusters, resulting in a lower FSR recovery 
efficiency. 

The Z boson candidates are reconstructed from pairs of leptons of the same flavour and op- 
posite charge {£^i^). The lepton pair with an invariant mass closest to the nominal Z mass is 
denoted as Zi with mass niz^ and is retained if it satisfies 40 < niz^ < 120 GeV. The invariant 
mass of the second Z candidate, denoted Z2, must satisfy 12 < < 120 GeV. The minimum 
value of 12 GeV is found from simulation to provide the optimal sensitivity for a Higgs boson 
mass in the range 110 < oth < 160 GeV. If more than one Z2 candidate satisfies all the criteria, 
we choose the candidate reconstructed from the two leptons with the highest scalar sum of 
their pj. Among the four selected leptons forming Zi and Z2, at least one is required to have 
Pj > 20 GeV and another pj > 10 GeV. These pj thresholds ensure that the selected leptons 
are on the high-efficiency plateau for the trigger. To further reject leptons originating from 
weak semileptonic hadron decays or decays of low-mass hadronic resonances, we require that 
all opposite-charge pairs of leptons chosen from among the four selected leptons (irrespective 
of flavour) have an invariant mass greater than 4 GeV. The phase space for the Higgs boson 
search is defined by restricting the four-lepton mass range to nii^e > 100 GeV. The predicted 
lepton Pj distributions from the MC simulation for a Higgs boson with mn = 125 GeV are 



shown in Fig. 17 for the 4e, 4ji, and 2e2^ channels. Also given in Fig. 17 (bottom right) are 
the event selection efficiencies for each of the three lepton channels, as a function of the Higgs 
boson mass. These distributions clearly emphasize the importance of low lepton-px thresholds 



and high lepton efficiencies. The selection efficiencies shown in Fig. 17 are relative to events 
where all four leptons are within the geometrical acceptance and all dilepton invariant masses 
satisfy m^^ > 1 GeV. The combined signal reconstruction and selection efficiency, for a Higgs 
boson with nin = 125 GeV, is 18% for the 4e channel, 40% for the 4f/ channel, and 27% for the 
2e2}i channel. The expected resolution on the per-event mass measurement is on average 2.2% 
for the 4e channel, 1.1% for the Aji channel, and 1.6% for the 2e2}i channel. 

The kinematics of the H — ?► ZZ — ?► Ai process, as well as for any boson decaying to ZZ, has been 
extensively studied in the literature II117H129I . Since the Higgs boson is spinless, the angular 
distribution of its decay products is independent of the production mechanism. In the Higgs 
boson rest frame, for a given invariant mass of the Ai system, the kinematics are fully described 
by five angles, denoted Q, and the invariant masses of the two lepton pairs Zi and Z2. These 
seven variables provide significant discriminating power between signal and background. 

A kinematic discriminant (Xq) is introduced using the full probability density in the dilep- 
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Figure 17: The MC simulation distributions of the lepton transverse momentum pj for each 
of the four leptons, ordered by pj, from the process H — > ZZ — > 4£ for a Higgs boson mass of 
125 GeV in the 4e (top left), 4]i (top right), and 2e2n (bottom left) channels. The distributions are 
shown for events when all four leptons are within the geometrical acceptance of the analysis 
(open histograms), and for events passing the final selection criteria (solid histograms). The 
bottom-right plot displays the event selection efficiencies for H — ?► ZZ — > A£ determined from 
MC simulation, as a function of the Higgs boson mass, for the 4e, and 2e2}i channels. The 
efficiencies are relative to events where all four leptons are within the geometrical acceptance. 
Divergent contributions from Z7* with 7* — )■ £i at generator level are avoided by requiring 
that all dilepton invariant masses are greater than 1 GeV. 
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ton masses and angular variables, P(?«Zi,mz2, n|m4^). The Kd is constructed for each can- 
didate event based on the probability ratio of the signal and background hypotheses, Ko = 
Vsig/ {'Psig + ^bkg)/ as described in Refs. Il23l HSOJ. For the signal, the phase-space and Z- 
propagator terms II119I are included in a fully analytic parametrization of the Higgs boson 
signal II126I . An analytic parametrization is also used for the background probability distribu- 
tion for the mass range above the ZZ threshold, while it is tabulated using a MC simulation of 
the qq — > ZZ(Z7*) process below this threshold. 

6.2 Background estimation and systematic uncertainties 

The small number of observed candidate events precludes a precise direct determination of 
the background by extrapolating from the signal region mass sidebands. Instead, we rely on 
MC simulation to evaluate the local density (AN/ Ani/^i) of ZZ background events expected as 
a function of m^£. The cross section for ZZ production at NLO is calculated with MCFM 111311 - 
11331 . This includes the dominant process from qq annihilation, as well as from gluon-gluon 
fusion. The uncertainties in the predicted number of background events owing to the variation 
in the QCD renormalization and factorization scales and PDF set are on average 8% for each 
final state [26 1. The number of predicted ZZ — > A£ events and their systematic uncertainties 
after the signal selection are given in Table |5j 

The reducible Z + bb, tt, Z + jets, Z + 7 + jets, and WZ + jets backgrounds contain at least 
one nonprompt lepton in the four-lepton final state. The main sources of nonprompt leptons 
are electrons and muons coming from decays of heavy-flavour quark, misidentified jets (usu- 
ally originating from light-flavour quarks), and electrons from photon conversions. The lepton 
misidentification probabilities are measured in data samples of Z + jet events with one addi- 
tional reconstructed lepton, which are dominated by final states that include a Z boson and a 
fake lepton. The contamination from WZ production in these events is suppressed by requir- 
ing E^^^ < 25 GeV. The lepton misidentification probabilities measured from these events are 
consistent with those derived from MC simulation. These misidentification probabilities are 
applied to dedicated Zi + X control samples, where X contains two reconstructed leptons with 
relaxed isolation and identification criteria. Starting from these samples, two complementary 
approaches are used to extract the corresponding reducible Z + X background yield expected 
in the 4£ signal region. The first approach avoids signal contamination in the background 
sample by reversing the opposite-sign requirement on the Z2 lepton candidates, and then ap- 
plies the fake lepton efficiencies to the additional leptons to calculate the expected number 
of background events in the signal sample. The second approach uses a control region de- 
fined by two opposite-sign leptons failing the isolation and identification criteria, and using 
the misidentification probability to extrapolate to the signal region. In addition, a control re- 
gion with three passing leptons and one failing lepton is also used to estimate the background 
with three prompt leptons and one misidentified lepton. Comparable background predictions 
in the signal region are found from both methods within their uncertainties. The average of 
the two predictions is used for the background estimate, with an uncertainty that includes the 
difference between them (see Table |5]l. 

Systematic uncertainties are evaluated from the data for the trigger (1.5%), and the combined 
four-lepton reconstruction, identification, and isolation efficiencies that vary from 1.2% in the 
4f/ channel at wjh = 150 GeV to about 11% in the 4e channel at = 120 GeV. The effects 
of the systematic uncertainties in the lepton energy-momentum calibration (0.4%) and energy 
resolution on the four-lepton invariant-mass distribution are taken into account. The accuracy 
of the absolute mass scale and resolution is validated using Z — > Y — > ££, and ]/ip — ?► ££ 
events. The effect of the energy resolution uncertainty is taken into account by introducing a 
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20% variation on the simulated width of the signal mass peak. An uncertainty of 50% is as- 
signed to the reducible background rate. This arises from the finite statistical precision in the 
reducible background control regions, differences in the background composition between the 
various control regions, and differences between the data samples used to measure the lep- 
ton misidentification probabilities. Since all the reducible and instrumental background are 
estimated using control regions in the data, they are independent of the uncertainty in the inte- 
grated luminosity However, this uncertainty (2.2% at 7TeV |134| and 4.4% at 8 TeV |,135J ) does 
affect the prediction of the ZZ background and the normalization of the signal in determin- 
ing the Higgs boson cross section. Finally, the systematic uncertainties in the theoretical Higgs 
boson cross section (17-20%) and Ai branching fraction (2%) are taken from Ref. ||25| . 



6.3 Results 

The number of selected ZZ — > A£ candidate events in the mass range 110 < m/^( < 160 GeV for 
each of the three final states is given in Table |5j The number of predicted background events in 
each of the three final states and their uncertainties are also given, together with the number of 
signal events expected from a SM Higgs boson of nin = 125 GeV. 

Table 5: The number of observed selected events, compared to the expected background yields 
and the expected number of signal events {nin = 125 GeV) for each lepton final state in the H — ?► 
ZZ — >• A£ analysis. The estimates of the ZZ background are from MC simulation and the Z + X 
background are based on data. These results are given for the four-lepton invariant-mass range 
from 110 to 160 GeV. The total expected background and the observed numbers of events are 
also given integrated over the three bins ("signal region" defined as 121.5 < m^i < 130.5 GeV) 



of Fig. 18 centred on the bin where the most significant excess is seen. The uncertainties shown 
include both statistical and systematic components. 



Channel 


4e 




2e2^ 


Total 


ZZ background 


2.7 ± 0.3 


5.7 ± 0.6 


7.2 ± 0.8 


15.6 ± 1.4 


Z + X 


1 2+1-^ 






4 4+2.2 


All backgrounds (110 < mu < 160 GeV) 


3 9+^-^ 
'^•'^-0.8 


6.61^- 


9 5+2-U 


20.0;^;^ 


Observed (110 < m^f < 160 GeV) 


6 


6 


9 


21 


Expected Signal (oth = 125 GeV) 


1.37 ± 0.44 


2.75 ± 0.56 


3.44 ± 0.81 


7.6 ± 1.1 


All backgrounds (signal region) 




1 9r;+0.15 
-^•"^'-'-0.13 


1 8C!+U.36 
l.OO_Q28 


a 7q+U.4'/ 


Observed (signal region) 


1 


3 


5 


9 



The observed distribution from data is shown in Fig. 18 There is a clear peak at the Z boson 
mass from the decay Z — > 4£ l|136( . The size and shape of the peak are consistent with those 
from the background prediction. Over the full Higgs boson search region from 110 to 160 GeV, 
the reducible background from Z+X events is much smaller than the irreducible ZZ{Zj* ) back- 
ground. There is an excess of events above the expected background near 125 GeV. The total 
number of observed events and the expected number of background events in the three bins 
centred on the excess (121.5 < < 130.5 GeV), and referred to as the "signal" region, are 
given in Table 5. The expected four-lepton invariant-mass distribution for a Higgs boson with 



a mass of 125 GeV is shown by the open histogram in Fig. 18 



The distributions of the reconstructed Zj and Z2 dilepton invariant masses for the events in the 
signal region are shown in the left and right plots of Fig. [l9| respectively. The Zi distribution 
has a tail towards low invariant mass, indicative that also the highest mass Z is often off-shell. 
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Figure 18: Distribution of the observed four-lepton irwariant mass from the combir\ed 7 and 
8TeV data for the H — > ZZ — > 4£ analysis (points). The prediction for the expected Z+X and 
ZZ(Z7*) background are shown by the dark and light histogram, respectively. The open his- 
togram gives the expected distribution for a Higgs boson of mass 125 GeV. 
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Figure 19: Distributions of the observed Zi (left) and Z2 (right) dilepton invariant masses for 
four-lepton events in the mass range 121.5 < m^^ < 130.5 GeV for the combined 7 and 8TeV 
data (points) . The shaded histograms show the predictions for the background distributions, 
and the open histogram for a Higgs boson with a mass of 125 GeV. 
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The two-dimensional distribution of the kinematic discriminant Kq versus the four-lepton re- 
constructed mass m^i is shown in Fig. |20]for the individual selected events. Superimposed on 
this figure are the contours of the expected event density for the background (upper) and a SM 
Higgs boson at mn = 125 GeV (lower). A clustering of events is observed in the region around 
m^£ = 125 GeV with > 0.7. The background expectation is low in this region and the sig- 
nal expectation is high, corresponding to the excess of events above background seen in the 
one-dimensional distribution. 

The observed distribution of the discriminant values for invariant masses in the signal 



range 121.5 < m4(? < 130.5 GeV is shown in Fig. 21 (left). The m4(? distribution of events sat 



isfying Ku > 0.5 is shown in Fig. 21 (right). The clustering of events is clearly visible near 
m4^Ril25 GeV. 



7 H ^ WW 

The decay mode H — > WW is highly sensitive to a SM Higgs boson with a mass around the 
WW threshold of 160 GeV. With the lepton identification and E^'^® reconstruction optimized 
for LHC pileup conditions, it is possible to extend the sensitivity down to 120 GeV. The search 
strategy for H — > WW is based on the final state in which both W bosons decay leptonically, 
resulting in a signature with two isolated, oppositely charged, high-px leptons (electrons or 
muons) and large £^'^^ caused by the undetected neutrinos. It is not possible to reconstruct the 
Higgs mass in this final state, nevertheless there is some mass sensitivity via different kinematic 
distributions like the dilepton mass or the invariant mass of leptons and £^'^*. The analysis of 
the 7 TeV data is described in Ref . |137l and remains unchanged, while the 8 TeV analysis is 
modified to cope with the more difficult conditions induced by the higher pileup in the 2012 
data taking, and is explained below. 

7.1 WW event selection 

To improve the signal sensitivity, events are separated by jet multiplicity into three mutually 
exclusive categories, which are characterized by different expected signal yields and signal- 
to-background ratios. We call these the 0-jet, 1-jet, and 2-jet categories. Jets are reconstructed 
using the selection described in Section |3j and events are classified according to the number 
of selected jets with Ej > 30 GeV and \f]\ <4.7. To exclude electrons and muons from the 
jet sample, these jets are required to be separated from the selected leptons in AR by at least 
ARjet-iepton > o.3. Events with more than 2 jets are only considered if there are no additional 
jets above this threshold present in the pseudorapidity region between the two highest- Ej jets. 
Furthermore, the search splits candidate signal events into three final states, denoted by: e+e^, 
and e^ji^. 

The bulk of the signal arises through direct WW decays to dileptons of opposite charge, where 
the small contribution proceeding through an intermediate T leptonic decays is implicitly in- 
cluded. The events are selected by triggers that require the presence of one or two high-px 
electrons or muons. The trigger efficiency for signal events that pass the full event selection 
is measured to be above 97% in the ji^fi^ final state, and above 98% in the e+e^ and e^ji^ 
final states for a Higgs boson mass of about 125 GeV. The trigger efficiencies increase along 
with Higgs boson mass. These efficiencies are measured using Z/ 7*— ?■ i^£^ events [,1161 , with 
associated uncertainties of about 1%. 

Two oppositely charged lepton candidates are required, with pj > 20 GeV for the higher-px 
lepton (p^''"^") and px > 10 GeV for the lower-px lepton (Px"^™). Only electrons (muons) with 
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Figure 20: The two-dimensional distribution of the kinematic discriminant Kd versus m^g for 
selected 4£ events in the combined 7 and 8 TeV data. Events in the three different final states are 
designated by symbols shown in the legend. The horizontal error bars indicate the estimated 
per-event mass resolution deduced from the combination of the per-lepton momentum uncer- 
tainties. The contours in the upper plot show the event density for the background expectation, 
and in the lower plot the contours for a SM Higgs boson with niyi = 125 GeV (both in arbitrary 
units). 
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Figure 21: Left: Distribution of the kinematic discriminant Ku for H — > ZZ — > 4£ candidate 
events from the combined 7 and 8 TeV data (vertical lines) in the signal mass region 121.5 < 
m^f < 130.5 GeV. The predicted distributions for the Z+X and ZZ(Z7*) backgrounds and for a 
Higgs boson with a mass of 125 GeV are shown by the histograms. Right: The m4(? distribution 
for data events with Ku > 0.5 (points) and the predicted distributions for the backgrounds and 
a Higgs boson with a mass of 125 GeV (histograms). 



< 2.5 (2.4) are considered in the analysis. 

A tight muon selection is applied, as described in Section|3] Muons are required to be isolated to 
distinguish between muon candidates from W boson decays and those from QCD background 
processes, which are usually in or near jets. For each muon candidate, the scalar sum of the 
transverse energy of all particles consistent with originating from the primary vertex is recon- 
structed in cones of several widths around the muon direction, excluding the contribution from 
the muon itself. This information is combined using a multivariate algorithm that exploits the 
differences in the energy deposition between prompt muons and muons from hadron decays 
inside a jet. 

Electron candidates are identified using the multivariate approach described in Section|3] Elec- 
trons are required to be isolated by applying a threshold on the sum of the transverse energy 
of the particles that are reconstructed in a cone around them, excluding the contribution from 
the electron itself. For both electrons and muons, a correction is applied to account for the 
contribution to the energy in the isolation cone from pileup, as explained in Section |3] 

In addition to high-momentum, isolated leptons and minimal jet activity, missing transverse 
momentum is present in signal events, but generally not in the background. In this analysis, a 
projected E^^'^^ variable is employed. It is equal to the component of the E^'^® vector transverse 
to the nearest lepton direction, if the difference in azimuthal angle between this lepton and the 
£miss vector is less than 90° . If there is no lepton within 90° of the £^'^® direction in azimuth, the 
value of E^^^^ is used. Since the projected EJ^'** resolution is degraded by pileup, the minimum 
of two E^^ observables is used in the determination of the projected E^^ value: the first is 
the standard E^'**, while the second uses only charged particles associated with the primary 
vertex to measure the missing transverse energy. Events with projected E^'®* above 20 GeV are 
selected for the analysis. 
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To suppress the top-quark background, a top-quark tagging technique, based on low-momentum 
muon identification and b-jet tagging [31], is applied. The first selection is designed to veto 
events containing muons fromb hadrons coming from top-quark decays. The second selection 
uses a b-jet tagging algorithm that looks for tracks with large impact parameter within jets. The 
rejection when combining the two selections for the top-quark background is about 50% in the 
0-jet category and above 80% for events with at least one jet passing the selection criteria. 

Various selection criteria are used to reduce the other background contributions. For the W+jets 
background, a minimum dilepton transverse momentum (pj) of 45 GeV is required. To reduce 
the background from WZ production, any event that has a third lepton passing the identi- 
fication and isolation requirements is rejected. This requirement rejects less than 1% of the 
WW — > 2i2v events, while rejecting around 35% of the remaining WZ events. The contribution 
from W7 production, where the photon converts into a electron pair, is reduced by about 90% 
in the dielectron final state by requirements that reject 7 conversions. Those requirements con- 
sist in finding tracks that associated with the electron give good conversion candidates. The 
background from low-mass resonances is rejected by requiring a dilepton mass (m^^) greater 
than 12 GeV. 

The Drell-Yan process produces same-flavour lepton pairs (e+e^ and In order to sup- 

press this background, a few additional requirements are applied in the same-flavour final 
states. First, the resonant Z component of the Drell-Yan production is rejected by requir- 
ing a dilepton mass outside a 30 GeV window centred on the Z mass. Then, the remaining 
off-peak contribution is suppressed by exploiting different £^'®® -based approaches depending 
on the number of jets and the Higgs boson mass hypothesis. At large Higgs boson masses 
(ffjH > 140 GeV), signal events are associated with large £™'®® and, thus, to suppress the Drell- 
Yan background it is sufficient to require the minimum of the two projected E^^^^ variables to 
be greater than 45 GeV. On the contrary, in low-mass Higgs boson events (wh < 140 GeV) it 
is more difficult to separate the signal from the Drell-Yan background; therefore in this case, a 
dedicated multivariate selection, combining the missing transverse momentum with kinematic 
and topological variables, is used to reject Drell-Yan events and maximize the signal yield. A 
third approach is employed in events with two jets. Here, the dominant source of £^'®® is the 
mismeasurement of the hadronic jet energy, and the optimal performance is obtained by re- 
quiring £^'®® > 45 GeV. Finally, the momenta of the dilepton system and the most energetic 
jet must have an angle in the transverse plane smaller than 165°. These selections reduce the 
Drell-Yan background by three orders of magnitude, while rejecting less than 50% of the signal, 
as determined from simulation. 

After applying the full set of selection criteria, referred to as the WW selection, the observed 
yields in the combined 7 and 8TeV data set are 1594, 1186, and 1295 events in the 0-jet, 1-jet, 
and 2-jet categories, respectively. This sample is dominated by nonresonant WW events in 
the 0-jet category and by a similar fraction of WW and top events in the other two categories. 
The main efficiency loss is due to the lepton selection and the stringent £^'^® requirements. 



Figures 22 and 23 show the observed distributions of the azimuthal angle difference (Acpff ) and 
the dilepton mass {mu) after the WW selection, respectively, and the expected distributions 
for a SM Higgs boson with mn = 125 GeV and for backgrounds in the 0- and 1-jet categories. 
The clear difference in the shape between the H — ?► WW and the nonresonant WW processes is 
because of the spin-0 nature of the Higgs boson. 
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CMS \/s = 8TeV, L = 5.1 fb-^ CMS \/s = 8 TeV, L = 5.1 fb"^ 




Figure 22: Distributions of the azimuthal angle difference between selected leptons in 
the 0-jet (left) and 1-jet (right) categories, for data (points), the main backgrounds (solid his- 
tograms), and a SM Higgs boson signal with nin = 125 GeV (hatched histogram) at 8 TeV. The 
standard WW selection is applied. 
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Figure 23: Distributions of the dilepton invariant mass mf£ of selected dileptons in the 0-jet 
(left) and 1-jet (right) categories, for data (points), the main backgrounds (solid histograms), 
and a SM Higgs boson with jmh = 125 GeV (hatched histogram) at 8 TeV. The standard WW 
selection is applied. The last bin contains overflows. 
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7.2 H WW search strategy 



To enhance the sensitivity for a Higgs boson signal, a cut-based approach is chosen for the fi- 
nal event selection. Because the kinematics of signal events change as a function of the Higgs 
boson mass, separate optimizations are performed for different mn hypotheses. The extra re- 
quirements, designed to optimize the sensitivity for a SM Higgs boson, are placed on pj^^^, 
Pj^^, m££, A(p££ and the transverse mass mj, defined as ^2pjE™^^{l — cos A^gmiss^^), where 

Acp^missfj is the difference in azimuthal angle between the E^^^^ direction, and the transverse 
momentum of the dilepton system. The requirements, which are the same for both the 0- and 
1-jet categories, are summarized in Table |6] The niff distribution in the 0-jet (left) and 1-jet 
(right) categories for the e/^ candidate events are shown in Fig. 24 along with the predictions 
for the background and a SM Higgs boson with mn = 125 GeV. 

Table 6: Final event selection requirements for the cut-based analysis of the 0- and 1-jet event 
samples. Values for other Higgs boson mass hypotheses follow a smooth behavior with respect 
to the reported values. 



niH (GeV) 
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niej, (GeV) 


^<pu n 


ntj (GeV) 
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Figure 24: Dilepton invariant mass distribution from the O-jet (left) and 1-jet (right) eji events 
from the 8 TeV data (points with error bars), and the prediction for the various backgrounds 
(solid histograms), and for a SM Higgs boson with mn = 125 GeV (hatched histogram) at 8 TeV. 
The cut-based H — > WW selection, except for the requirement on the dilepton mass itself, is 
applied. 



The 2-jet category is mainly sensitive to VBF production Il74l[75l[77lll38 ), whose cross section is 
roughly ten times smaller than that from gluon-gluon fusion. The VBF channel offers a different 
production mechanism to test the consistency of a signal with the SM Higgs boson hypothesis. 
The VBF signal can be extracted using simple selection criteria, especially in the relatively low- 
background environment of the fully leptonic WW decay mode, providing additional search 
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sensitivity. The H — )■ WW events from VBF production are characterized by two energetic 
forward-backward jets and very little hadronic activity in the rest of the event. Events passing 
the WW criteria are further required to satisfy pj > 30 GeV for the two highest-Ej jets, with 
no jets above this threshold present in the pseudorapidity region between these two jets. Both 
leptons are required to be within the pseudorapidity region between the two jets. To reject the 
main background from top-quark decays, the two jets must have a pseudorapidity difference 
larger than 3.5 and a dijet invariant mass greater than 450 GeV. In addition, nij is required to 
be between 30 GeV and the Higgs boson mass hypothesis. Finally, a mn-dependent upper limit 
on the dilepton mass is applied. 

7.3 Background predictions 

A combination of techniques is used to determine the contributions from the background pro- 
cesses that remain after the final selection. The largest background contributions are estimated 
directly from data, avoiding uncertainties related to the simulation of these sources. The re- 
maining contributions estimated from simulation are small. 

The W+jets and QCD multijet backgroimds arise from semileptonic decays of heavy quarks, 
hadrons misidentified as leptons, and electrons from photon conversions. Estimations of these 
contributions are derived directly from data, using a control sample of events in which one 
lepton passes the standard criteria and the other does not, but instead satisfies a relaxed set 
of requirements ("loose" selection), resulting in a "tight-loose" sample. Then the efficiency, 
^loose/ for s lepton candidate that satisfies the loose selection to also pass the tight selection 
is determined, using data from an independent multijet event sample dominated by non- 
prompt leptons and parametrized as a function of the pj and t] of the lepton. Finally, the 
background contamination is estimated using the events in the "tight-loose" sample, weighted 
by eioose/ (1 ~ ^loose)- The systematic uncertainty in the determination of eioose dominates the 
overall uncertainty of this method, which is estimated to be about 36%. The imcertainty is 
obtained by varying the requirements to obtain eioose/ arid from a closure test, where the tight- 
loose rate derived from QCD simulated events is applied to a W + jets simulated sample to 
predict the rate of events with one real and one misidentified lepton. 

The normalization of the top-quark background is estimated from data by counting the num- 
ber (Ntagged) of top-quark-tagged events and applying a corresponding top-quark-tagging ef- 
ficiency etop- The top-quark-tagging efficiency (etop) is measured with a control sample dom- 
inated by tt and Wt events, which is selected by requiring a b-tagged jet in the event. The 
number of top-quark background events in the signal region is then given by: Njagggd x (1 — 
etop)/£top- Background sources from non-top events are subtracted by estimating the misiden- 
tification probability from data control samples. The main uncertainty comes from the statis- 
tical uncertainty in the b-tagged control sample and from the systematic uncertainties related 
to the measurement of etop- The uncertainty is about 20% in the 0-jet category and about 5% in 
the 1-jet category. 

For the low-mass H — > WW signal region, wh < 200 GeV, the nonresonant WW background 
prediction is estimated from data. This contribution is measured using events with a dilepton 
mass larger than 100 GeV, where the Higgs boson signal contamination is negligible, and the 
MC simulation is then used to extrapolate into the signal region. The total uncertainty is about 
10%, where the statistical uncertainty of the data control region is the largest component. For 
larger Higgs boson masses there is a significant overlap between the nonresonant WW and 
Higgs boson signal, and the simulation is used for the estimation of the background. 

The Z/j*^ contribution to the e+e~ and final states is estimated by extrapolat- 
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ing the observed number of events with a dilepton mass within ±7.5 GeV of the Z mass, with 
the residual background in that region subtracted using e^}i^ events. The extrapolation to the 
signal region is then performed using the simulation. The results are cross-checked with data, 
using the same algorithm and subtracting the background in the Z-mass region, estimated from 
the number of e^^^ events. The largest uncertainty in the estimate is the statistical uncertainty 
in the control sample, which is about 20% to 50%. The Z/ 7* — )■ t+t" contamination is esti- 
mated using Z/7*— > e+e^ and events selected in data, where the leptons are replaced 
with simulated t decays, thus providing a better description of the process Z/ 7* — > t+t^. The 
TAUOLA [54] program is used in the simulation of the t decays to account for r-polarization 
effects. 

Finally, to estimate the W7* background contribution from asymmetric virtual photon decays 
to dileptons [139], where one lepton escapes detection, the MadGraph generator [104] with 
dedicated cuts is used. In particular, all the leptons are required to have a px larger than 5 
GeV and the mass of each lepton is considered in the generation of the samples. To normalize 
the simulated events, a control sample of high-purity W7* events from data with three recon- 
structed leptons is compared to the simulation prediction. A normalization factor of 1.6 ± 0.5 
with respect to the theoretical leading-order W7* cross section is found. 

Other minor backgrounds from WZ, ZZ (when the two selected leptons come from different 
boson decays), and W7 are estimated from simulation. The W7 background estimate is cross- 
checked in data using events passing all the selection requirements, except the two leptons 
must have the same charge; this sample is dominated by W+jets and W7 events. The agreement 
between data and the background prediction in this test is at the 20% level. 

The number of observed events and the expected number of events from all background pro- 
cesses after the WW selection are summarized in Table |7| The number of events observed in 
data and the signal and background predictions after the final selection are listed in Table |8]for 
two Higgs boson mass hypotheses. 



Table 7: Observed number of events and background estimates for the 8 TeV data sample, after 
applying the WW selection requirements. The uncertainties are statistical only. 





WW 


tt+tw 


W+jets 


WZ + ZZ 


Z/7* 


W7(*) 


tot. bkg. 


data 


0- jet 

1- jet 

2- iet 


1046.1 ± 7.2 
381.0 ± 4.0 
177.0 ± 2.8 


164.2 ± 5.4 

527.3 ± 8.4 
886.5 ± 11.1 


158.2 ± 7.1 
122.6 ± 6.7 
94.9 ± 6.4 


32.6 ± 0.6 
30.3 ± 0.6 
20.8 ± 0.5 


73± 17 
77 ±24 
227 ± 20 


27.1 ± 3.9 
23.7 ± 5.2 
5.6 ± 2.1 


1501 ± 21 
1162 ± 27 
1412 ± 24 


1594 
1186 
1295 



7.4 Efficiencies and systematic uncertainties 

The signal efficiency is estimated using simulations. All Higgs boson production mechanisms 
are considered: gluon-gluon fusion, associated production with a W or Z boson (VH), and VBF 
processes. 

Residual discrepancies in the lepton reconstruction and identification efficiencies between data 
and simulation are corrected for by data-to-simulation scale factors measured using Z/7* — > 
events in the Z-peak region II116I , recorded with dedicated unbiased triggers. These factors de- 
pend on the lepton px arid |f/|, and are typically in the range 0.9-1.0. The uncertainties on the 
lepton and trigger efficiencies are about 2% per lepton leg. 

Experimental effects, theoretical predictions, and the choice of MC event generators are con- 
sidered as sources of systematic uncertainty, and their impact on the signal efficiency is as- 
sessed. The experimental uncertainties in lepton efficiency, momentum scale and resolution. 
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Table 8: The signal predictions, background estimates, and numbers of events in data for 
two different Higgs boson mass hypotheses with the 8 TeV data set, after applying the final 
H — > WW cut-based requirements, which depend on the Higgs boson mass hypothesis. The 
different jet categories and dilepton final states are shown separately. The combined statistical, 
experimental, and theoretical systematic uncertainties are given. 





H ^ WW 


WW 


WZ + ZZ + Z/7* 


tt+tw 


W+jets 


W7(*) 


allbkg. 


data 


0-jet category e}i final state 


125 


23.9 ± 5.2 


87.6 ±9.5 


2.2 ± 0.2 


9.3 ±2.7 


19.1 ± 7.2 


6.0 ± 2.3 


124.2 ± 12.4 


158 


130 


35.3 ± 7.6 


96.8 ± 10.5 


2.5 ± 0.3 


10.1 ±2.8 


20.7 ±7.8 


6.3 ± 2.4 


136.3 ±13.6 


169 



0-jet category ee/ }if,i final state 



125 


14.9 ± 3.3 


60.4 ±6.7 


377 ±12.5 


1.9 ±0.5 


10.8 ±4.3 


4.6 ±2.5 


115.5 ±15.0 


123 


130 


23.5 ± 5.1 


67.4 ± 7.5 


41.3 ±15.9 


2.3 ±0.6 


11.0 ±4.3 


4.8 ±2.5 


126.8 ±18.3 


134 



1-jet category eji final state 



125 
130 


10.3 ± 3.0 
15.7 ±4.7 


19.5 ±3.7 
22.0 ± 4.1 


2.4 ± 0.3 
2.6 ± 0.3 


22.3 ± 2.0 
25.1 ± 2.2 


11.7 ±4.6 

12.8 ±5.1 


5.9 ±3.2 
6.0 ±3.2 


61.7 ±70 
68.5 ± 7.6 


54 
64 


1-jet categorj 


I ee/npi final state 


125 
130 


4.4 ±1.3 
71 ±2.2 


9.7 ±1.9 
11.2 ±2.2 


8.7 ±4.9 
9.1 ±5.4 


9.5 ±1.1 
10.7 ±1.2 


3.9 ±1.7 
3.7 ±1.7 


1.3 ±1.2 
1.3 ±1.2 


33.1 ±5.7 
36.0 ± 6.3 


43 
53 


2-jet category eji final state 


125 
130 


1.5 ±0.2 
2.5 ±0.4 


0.4 ±0.1 
0.5 ± 0.2 


0.1 ±0.0 
0.1 ±0.0 


3.4 ±1.9 
3.0 ±1.8 


0.3 ± 0.3 
0.3 ± 0.3 


0.0 ±0.0 
0.0 ±0.0 


4.1 ±1.9 
3.9 ± 1.9 


6 
6 


2-jet categor) 


r ee/n}i final state 


125 
130 


0.8 ±0.1 
1.3 ±0.2 


0.3 ±0.1 
0.4 ± 0.2 


3.1 ±1.8 
3.8 ±2.2 


2.0 ±1.2 
2.0 ±1.2 


0.0 ±0.0 
0.0 ± 0.0 


0.0 ±0.0 
0.0 ±0.0 


5.4 ±2.2 
6.2 ±2.5 


7 
7 



E^i^^ modelling, and jet energy scale are applied to the reconstructed objects in simulated 
events by smearing and scaling the relevant observables, and propagating the effects to the 
kinematic variables used in the analysis. The 36% normalization uncertainty in the W + jets 
background is included by varying the efficiency for misidentified leptons to pass the tight 
lepton selection and by comparing the results of a closure test using simulated samples. 

The relative systematic uncertainty on the signal efficiency from pileup is evaluated to be 1%. 
This corresponds to shifting the mean of the expected distribution of the number of pp collision 
per beam-crossing that is used to reweight the simulation up and down by one pp interaction. 
The systematic uncertainty on the integrated luminosity measurement is 4.4% II135I . 

The systematic uncertainties from theoretical input are separated into two components, which 
are assumed to be independent. The first component is the uncertainty in the fraction of events 
classified into the different jet categories and the effect of migration between categories. The 
second component is the uncertainty in the lepton acceptance and the selection efficiency of 
the other requirements. The effect of variations in the PDF, the value of a,, and the higher- 
order corrections are considered for both components, using the PDF4LHC prescription [95]- 
|99| and the recommendations from Il25l . For the jet categorization, the effects of higher-order 
logarithmic terms via the uncertainty in the parton shower model and the underlying event 
are also considered by comparing different generators. These uncertainties range between 10% 
and 30%, depending on the jet category. The uncertainties related to the diboson cross sections 
are calculated using the MCFM program II131I . 

The systematic uncertainty in the overall signal efficiency is estimated to be about 20% and 
is dominated by the theoretical uncertainty in the missing higher-order corrections and PDF 
uncertainties. The total uncertainty in the background estimations in the H — > WW signal 
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region is about 15%, dominated by the statistical uncertainty in the observed number of events 
in the background-control regions. 

The interpretation of the results in terms of upper limits on the Higgs boson production cross 
section will be given in Section 10. 



8 H ^ TT 

The H — > TT decay mode is sensitive to a SM Higgs boson with a mass below about 145 GeV, 
for which the branching fraction is large. The search uses final states where the two T leptons 
are identified either by their leptonic decay to an electron or muon, or by their hadronic decay 
designated as t;,. Four independent channels are studied: eT;„ jat^, e}i, and In each channel, 
the signal is separated from the background, and in particular from the irreducible Z — > tt 
process, using the T-lepton pair invariant mass wJtt/ reconstructed from the four-momentum 
of the visible decay products of the two t leptons and the EJ^'** vector, as explained in Sec- 
tion 8.2 Events are classified by the number of additional jets in the final state, in order to 
enhance the contribution of different Higgs boson production mechanisms. The 0- and 1-jet 
categories select primarily signal events with a Higgs boson produced by gluon-gluon fusion, 
or in association with a W or Z vector boson that decays hadronically. These two categories are 
further classified according to the pj of the T-lepton decay products, because high-px events 
benefit from a higher signal-to-background ratio. Events in the VBF category are required to 
have two jets separated by a large rapidity, which preferentially selects signal events from the 
vector-boson fusion production mechanism and strongly enhances the signal purity. 

8.1 Trigger and inclusive event selection 

The high-level trigger requires a combination of electron, muon, and Tf, trigger objects ||42lll40[ 
[1411 . The electron and muon HLT reconstruction is seeded by electron and muon level-1 trigger 
objects, respectively, while the T;, trigger object reconstruction is entirely done at HLT stage. A 
specific version of the particle-flow algorithm is used in the HLT to reconstruct these objects 
and quantify their isolation, as done in the offline reconstruction. The identification and iso- 
lation criteria and the transverse momentum thresholds for these objects were progressively 
tightened as the LHC instantaneous luminosity increased over the data taking period. In the 
eXh and fiTf, channels, the trigger requires the presence of a lepton and a t;„ both loosely isolated 
with respect to the offline isolation criteria described below. In the e}i and jiji channels, the lep- 
ton trigger objects are not required to be isolated. For the eT;,, ^mt/j, and fiji channels, the muon 
and electron trigger efficiencies are measured with respect to the offline selection in the data 
and the simulation using Z — > ££{£ = e, fi) events passing a single-lepton trigger. For the eji 
channel, they are determined using Z — > tt — ?> e^ events passing a single-lepton trigger. The 
T;, triggering efficiency is obtained using Z — > tt — > ^T;, events passing a single-muon trigger. 
In the analysis, simulated events are weighted by the ratio between the efficiency measured in 
the data and the simulation, which are parametrized as a function of the lepton or T{j transverse 
momentum and pseudorapidity. 

To be considered in the offline event selection, electrons and muons must fulfill tight isolation 
criteria. The electron and muon isolation parameter Rf^^ is calculated as in Eq. ^ using a cone 
size AR = 0.4, but with the following differences. The sum I^charged Pt is performed considering 
all charged particles associated with the primary vertex, including other electrons and muons. 
The contribution of neutral pileup particles is estimated as 0.5 Licharged,pu PT/ where the sum 
is computed for all charged hadrons from pileup interactions in the isolation cone, and where 
the factor 0.5 corresponds approximately to the ratio of neutral-to-charged hadron energy in 
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the hadronization process, as estimated from simulation. Electrons and muons are required to 
have J^jgp < 0.1. This criterion is relaxed to 0.15 in the e}i channel for leptons in the barrel, and 
in the mi channel for muons with pj < 20GeV. The T-isolation discriminator R^^^ defined in 
Section|3]is used to select loosely isolated t;, so that the overall t;, identification efficiency is 60- 
65%, for a jet misidentification probability of 2-3%. Finally, electrons and muons misidentified 
as T;, are suppressed using dedicated criteria based on the consistency between the tracker, 
calorimeter, and muon-chamber measurements. 

In the eTj, and /^T/, channels, we select events containing either an electron with pj > 20 GeV or 
a muon with pj > 17 GeV, and \r]\ < 2.1, accompanied by an oppositely charged t;, with pj > 
20 GeV and \ri\ < 2.3. In the 8TeV data set analysis, the electron and muon pj thresholds are 
increased to 24 and 20 GeV, respectively, to account for the higher trigger thresholds. In these 
channels, events with more than one loosely identified electron or muon with pj > 15 GeV 
are rejected to reduce the Drell-Yan background. In the efi channel, we demand an electron 
within I?/ 1 < 2.3 and an oppositely charged muon within < 2.1. The higher-pj lepton must 
have Pj > 20 GeV and the other lepton pj > 10 GeV. In the channel, the higher-px muon is 
required to have pj > 20 GeV and the other muon pj > 10 GeV. Both muons must be within 
\f]\ < 2.1. 

Neutrinos produced in the r-lepton decay are nearly collinear with the visible decay products 
because the T-lepton energy is much larger than its mass after event selection. Conversely, 
in W+jets events where a jet is misidentified as T;„ one of the main backgrounds in the It}, 
channels, the high mass of the W results in a neutrino direction approximately opposite to the 
lepton in the transverse plane. In the eT^ and fiTh channels, we therefore require the transverse 
mass 



nij = ^j2pjE^'^^{l - cos(A(/>)) (3) 

to be less than 40 GeV, where pj is the lepton transverse momentum and is the azimuthal 
angle difference between the lepton momentum and the £^'®^ vector. In the eji channel, instead 
of an nij requirement, we demand = — 0.85 ■ p^ > —25 GeV, where 

p^ = pj,i-t + h,2-t + ^t''-L (4) 

pf = PT,l-t+V%2-t- (5) 



Here, as illustrated in Fig. 25 ^ is a unit vector along the ^ axis, defined as the bisector of the 



lepton directions in the transverse plane |142l , pT,i are the lepton transverse momenta, and £^ 
is the missing transverse energy vector. 



The distribution is shown in Fig. 27 h). Requiring a large rejects W+jets and tt events, for 
which the £^'®® vector is typically oriented in the opposite direction of the two-lepton system, 
resulting in a small D^. Conversely, in H — > tt or Z — > tt events, the neutrinos are emitted 
along the directions of the two t leptons, resulting in a large D^. The 0.85 factor is introduced 
to bring the mean of the distribution to for Z — )■ tt. 

In the channel, the sample of dimuon events is largely dominated by the Z ^ mi back- 
ground, which is suppressed using a BDT discriminant combining a set of variables related to 
the kinematics of the dimuon system, and the distance of closest approach between the two 
muons. 
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Caxis 




Figure 25: The ^ axis and the projections onto this axis of £^'^^ and transverse momenta px,! 
and Pt,2 of the two leptons. 

8.2 The TT invariant-mass reconstruction 

The invariant mass Wvis of the visible decay products of the two t leptons can be used as an 
estimator of the mass of a possible parent boson, in order to separate the H — > tt signal from 
the irreducible Z — ?► tt background. However, the neutrinos from T-lepton decays can have 
substantial energy limiting the separation power of this estimator. An alternative approach is 
to reconstruct the neutrino energy using a collinear approximation p43| , which has the disad- 
vantage of providing an unphysical solution for about 20% of the events, in particular when 
the E^** and the parent boson pj are small. The SVFit algorithm described below reconstructs 
the TT invariant-mass nijj with improved resolution and gives a physical solution for every 
event. 

Six parameters are needed to specify T-lepton decays to hadrons: the polar and azimuthal 
angles of the visible decay product system in the T-lepton rest frame, the three boost parameters 
from the T-lepton rest frame to the laboratory frame, and the invariant mass Wvis of the visible 
decay products. In the case of a leptonic T-lepton decay, two neutrinos are produced, and 
the invariant mass of the two-neutrino system constitutes a seventh parameter. The unknown 
parameters are constrained by four observables that are the components of the four-momentum 
of the system formed by the visible T-lepton decay products, measured in the laboratory frame. 
For each hadronic (leptonic) T-lepton decay, 2 (3) parameters are thus left unconstrained. We 
choose these parameters to be: 

• X, the fraction of the T-lepton energy in the laboratory frame carried by the visible 
decay products. 

• (p, the azimuthal angle of the T-lepton direction in the laboratory frame. 

• triyy, the invariant mass of the two-neutrino system. For hadronic T-lepton decay, 

niyi, = 0. 

The two components E^^^^ and E^^^^ of the missing transverse energy vector provide two fur- 
ther constraints, albeit with an experimental resolution of 10-15 GeV on each [144J. 

The fact that the reconstruction of the T-lepton pair decay kinematics is underconstrained by 
the measured observables is addressed by a maximum-likelihood fit method. The mass is 
reconstructed by combining the measured observables E'^^^ and E^'*® with a likelihood model 
that includes terms for the T-lepton decay kinematics and the E^'*^ resolution. The model 
gives the probability density /(z|y, 01,02) to observe the values z = (E^^^, E^'^®) in an event, 
given that the unknown parameters specifying the kinematics of the two T-lepton decays have 
values fli = {xi,(pi,myy^i) and ffi = {x2,(p2,mvv,2), and that the four-momenta of the visible 
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decay products have the measured values y = (Pi^^/P2^^)- The likelihood model is used to 
compute the probability 

P{m\^) = j & [m^TT - mTT(y,fli,fl2)) /(z|y,fli,fl2) dfll dfll, (6) 

as a function of mass hypothesis m\^. The best estimate /Wtt for wJtt is taken to be the value of 
that maximizes P{m\^). 

The probability density /(z|y, 01,02) is the product of three likelihood functions. The first two 
model the decay parameters fli and 02 of the two t leptons, and the last one quantifies the 
consistency of a r-lepton decay hypothesis with the measured E^'**. The likelihood functions 
modelling the T-lepton decay kinematics are different for leptonic and hadronic T-lepton de- 



cays. Matrix elements from Ref. |145| are used to model the differential distributions in the 
leptonic decays, 

within the physically allowed region < ;c < 1 and < niyy < m-j^^/l — x. For hadronic t- 
lepton decays, a model based on two-body phase-space l|146| is used, treating all the T-lepton 
visible decay products as a single system, 

dr 1 

^Th = — 9—/ (8) 

within the physically allowed region < ^ < 1- We have verified that the two-body phase 
space model is adequate for representing hadronic r-lepton decays by comparing distribu- 
tions generated by a parameterized MC simulation based on the two-body phase-space model 
with the detailed simulation implemented in TAUOLA. The likelihood functions for leptonic 
(hadronic) T-lepton decays do not depend on the parameters x and (p {x, cp, and nivy). The de- 
pendence on X enters via the integration boundaries, and the dependence on (p comes from the 
E^^*^ likelihood function. 

The E^'^® likelihood function Lmet quantifies the compatibility of a T-lepton decay hypothesis 
with the reconstructed missing transverse momentum in an event, assuming the neutrinos 
from the T-lepton decays are the only source of E^^, and is defined as 

1 / pmiss mI' \ / pmiss „i' 



In this expression, the expected E^'*® resolution is represented by the covariance matrix V, 
estimated on an event-by-event basis using a E^'^® -significance algorithm |144i , and | V | is the 
determinant of this matrix. 

The 

ni-cT resolution achieved by the SVFit algorithm is estimated to be about 20% from simu- 



lation. Figure 26 shows the normalized distributions of wZvis arid in the jiTh channel from 
simulated Z — ?► tt events and simulated SM Higgs boson events with nij-i = 125 GeV. The 
SVFit mass reconstruction allows for a better separation between signal and background than 
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Figure 26: Normalized distribution of the visible invariant mass wJvis (left) and SVFit mass m-cT 
(right) obtained from MC simulation in the ^T}J channel for the Z — )■ tt background (solid 
histogram) and a SM Higgs boson signal of mass niH = 125 GeV (open histogram). 

8.3 Event categories 

To further enhance the sensitivity of the search for the SM Higgs boson, the selected events 
are split into mutually exclusive categories based on the jet multiplicity and the transverse 
momentum of the visible T-lepton decay products. The jet multiplicity categories are defined 
using jets within \rj\ < 5. In some cases, events are rejected if they contain a b-tagged jet, 
identified using the CSV algorithm described in Section |3j From simulation, the efficiency for 
b-jet tagging is 75%, with a misidentification rate of 1%. The event categories are: 

• VBF: In this category, two jets with pj > 30 GeV are required in the event. A rapidity 
gap is demanded by requiring there be no third jet with pj > 30 GeV between these 
two jets. A BDT discriminator is used to discriminate between VBF Higgs boson 
production and the background processes. This discriminator takes as input the 
invariant mass of the two jets, the differences in t] and (p between the directions of 
the two jets, the pj of the t;,T;j system, the pj of the t;,t;,-£^^*^ system, the pj of the 
dijet system, and the difference in rj between the t;,T;, system direction and the closest 
jet. In the efi channel, the large tt background is suppressed by rejecting events with 
a b-tagged jet with pj > 20 GeV. 

• 1-jet: Events in this category are required to have >1 jet with pj > 30 GeV, not fulfill 
the VBF criteria, and not contain any b-tagged jet with pj > 20 GeV. This category 
addresses the production of a high-px Higgs boson recoiling against a jet. Events 
with high-px Higgs bosons typically have much larger £^'®® and thus benefit from 
a more precise measurement of nirT, owing to the improved E^'*® resolution. In 
the eT;, channel, the large background from Z — ?► ee + jets events with one electron 
misidentified as t/, is reduced by requiring E^^^ > 30 GeV. 

• 0-jet: This category requires events to have no jet with pj > 30 GeV and no b-tagged 
jet with Pj > 20 GeV. In the eT/, channel, E^^^ is required to be larger than 30 GeV 
as in the 1-jet category. 

The 0- and 1-jet categories are each further divided into two subsets, using the pj of the visible 
T-lepton decay products, either hadronic or leptonic. We label these subsets "low-px " and 
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"high-px "• In the ez/, and jiTh channels, the boundary between the two subsets is defined 
as pT{Th) = 40GeV. In the efi and channels, the threshold is at 35GeV on the muon pj 
and 30 GeV on the leading muon pj, respectively. Thus, five independent categories of events 
are used in the SM Higgs boson search: VBF, l-jet/high-pj, 1-jet/low-px/ 0-jet/high-px, and 
0-jet/low-px- 

8.4 Background estimation and systematic uncertainties 

For each channel and each category. Table |9] shows the overall number of events observed in 
the 7 and 8 TeV data, as well as the corresponding number of expected events from the various 
background contributions, in the full m^j range. The expected number of events from a SM 
Higgs boson signal of mass = 125 GeV is also shown. The numbers in Table |9] cannot 
be used to estimate the global significance of a possible signal since the expected significance 
varies considerably with nij^, and the sensitive 1-jet/high-px category is merged with the 1- 
jet/low-px category. 

The largest source of background is the Drell-Yan production of Z — > tt. This contribution 
is greatly reduced by the 1-jet and VBF selection criteria, and is modelled using a data sample 
of Z — > /^/^ events, in which the reconstructed muons are replaced by the reconstructed parti- 
cles from simulated T-lepton decays, a technique called "embedding". The background yield 
is rescaled to the Z — > f//^ yield in the data before any jet selection, thus, for this dominant 
background, the systematic uncertainties in the efficiency of the jet-category selections and the 
luminosity measurement are negligible. In the eT;, and //t;, channels, the largest remaining 
systematic uncertainty affecting this background yield is in the T;, selection efficiency. This un- 
certainty, which includes the uncertainty in the t;, triggering efficiency, is estimated to be 7% 
from an independent study based on a tag-and-probe technique |116|. 

The Drell-Yan production of Z — )■ £i, labelled as Z+jets in Table |9j is an important source of 
background in the ezi, channel, owing to the 2-3% probability for electrons to be misidentified 
as Tj, [48J, and the fact that the reconstructed tt invariant-mass distribution peaks in the Higgs 
boson mass search range. The contribution of this background in the eZh and f/T;, channels is 
estimated from simulation. The simulated Drell-Yan yield is rescaled to the data using Z — ?► 
mi events, and the efficiencies of the jet category selections are measured in a Z ^ data 
sample. The dominant systematic uncertainty in the background yield is from the i — ?► t;, 
misidentification rate, which is obtained by comparing tag-and-probe measurements from Z — > 
££ events in the data and the simulation, and is 30% for electrons and 100% for muons. The 
very small probability for a muon to be misidentified as T/, makes it difficult to estimate the 
systematic uncertainty in this probability, but also makes this background very small in the j^iT}, 
channel. 

The background from W+jets production contributes significantly to the eT;, and fiT}, channels 
when the W boson decays leptonically and one jet is misidentified as a T;,. The background 
is modelled for these channels using the simulation. The W+jets background yield is normal- 
ized to the data in a high-mx control region dominated by the background in each of the five 
categories. The factor for extrapolating to the low-mx signal region is obtained from the sim- 
ulation, and has a 30% systematic uncertainty. In the 1-jet/high-px and VBF categories, where 
the number of simulated events is marginal, mass-shape templates are obtained by relaxing 
the T;, isolation requirement, ensuring that the bias introduced in the shape is negligible. Fig- 
ure 27 (upper left) shows the nij distribution obtained in the fiTjj channel after the inclusive 
selection from data and simulation. In the high-mx region, the agreement between the ob- 
served and expected yields comes from the normalization of the W+jets prediction to the data. 
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Table 9: Observed and expected numbers of events in the four H — > tt decay channels and the 
3 event categories, for the combined 7 and 8 TeV data. The uncertainties include the statistical 
and systematic uncertainties added in quadrature. In the 0- and 1-jet categories, the low- and 
high-px subcategories have been combined. The expected number of signal events for a SM 
Higgs boson of mass mh = 125 GeV is also given. 



Process 


0-jet 


1-jet 


VBF 




Z-> TT 


2550 ± 200 


2130 ± 170 


53 ±6 


QCD 


910 ± 70 


410 ± 30 


35 ±8 


W+jets 


1210 ± 70 


1111 ±75 


46± 10 


Z+jets 


560 ± 99 


194 ±24 


13±2 


tt 


162 ± 21 


108 ± 13 


7±2 


Dibosons 


20 ±5 


60 ±14 


1.1 ±0.9 


Total Background 


5410 ± 270 


4020 ± 220 


155 ± 15 


H-)- TT (125GeV) 


15±2 


26 ±4 


4.4 ± 0.7 


Data 


5273 


3972 


142 




Z ^ TT 


50 500 ±3800 


10 570 ±830 


100 ±11 


QCD 


14 100 ±1600 


3980 ±510 


41 ±9 


W+jets 


13 300 ±1300 


5600 ±480 


72 ±15 


Z+jets 


1620 ± 230 


658 ± 97 


2.5 ± 0.6 


tt 


651 ± 82 


479 ± 61 


15 ±3 


Dibosons 


298 ± 70 


256 ±58 


3±2 


Total Backgroimd 


80 400 ± 4500 


21 500 ± 1200 


234 ±22 


TT (125 GeV) 


141 ± 21 


86 ±12 


8±1 


Data 


80229 


22009 


263 


eji 


Z — TT 


22 030 ±850 


5030 ±230 


56±5 


QCD 


940 ± 200 


550 ± 120 


7±2 


tt 


39 ±3 


831 ± 86 


24 ±6 


Dibosons 


796 ± 96 


550 ± 120 


11 ±2 


Total Backgroimd 


23 800 ±930 


6960 ±350 


99 ±9 


H-> TT (125 GeV) 


53 ±7 


35 ±4 


3.5 ± 0.5 


Data 


23274 


6847 


110 




Z TT 


9120 ±490 


1980 ± 120 


5.3 ± 0.4 


QCD 


759 ± 53 


341 ± 27 


<1 


W+jets 


145 ± 10 


19 ±1 


<1 


Z— >• jiji 


(1263 ± 73) X 10^ 


(380 ± 24) X 10^ 


71 ±10 


tt 


2440 ±200 


1330 ± 130 


7±2 


Dibosons 


1500 ± 1100 


2210 ± 790 


2.4 ± 0.9 


Total Background 


(1277 ± 73) X 10^ 


(386 ± 24) X 10^ 


85± 11 


H ^ TT (125 GeV) 


26 ±4 


16 ±2 


0.8 ±0.1 


Data 


1291 874 


385494 


83 
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Figure 27: The observed distributions (points with error bars) for the (upper left) transverse 
mass nij in the fiTh channel at ^/s = 7 TeV; (upper right) — 0.85 • p^^, (lower left) number 
of jets, and (lower right) number of b-tagged jets in the eji channel at a/s = 8 TeV. The ex- 
pected distributions from the various background sources are shown by the shaded histograms. 
In particular, the Electroweak background combines the expected contributions from W+jets, 
Z+jets, and diboson processes. The predictions for a SM Higgs boson with nin = 125 GeV are 
given by the dotted histograms, multiplied by a factor of 5 for clarity. 

The agreement in shape indicates good modelling of E^^^^ in the simulation. 

The tt production process is the main remaining background in the efi channel. The predicted 
yield for all channels is obtained from simulation, with the yield rescaled to the one observed in 
the data from a tt-enriched control sample, extracted by requiring b-tagged jets. The systematic 
uncertainty in the yield includes a 10% systematic uncertainty in the b-tagging efficiency. Fig- 
ures |27| (upper right), (lower left), and (lower right) show the distributions of D^, the number 
of jets, and the number of b-tagged jets in the eji channel. There is good agreement between 
the data and the background predictions in the distributions at low values in Fig. [27| (upper 



right), and at high numbers of jets in Fig. 27 (lower left) and (lower right), where the tt process 
dominates. 

QCD multijet events, in which one jet is misidentified as Zi, and another as a lepton, consti- 
tute another important source of background in the ez;, and f/T;, channels. In the 0- and 1-jet 
categories, the QCD multijet background prediction is obtained using a control sample where 
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the lepton and the Th are required to have the same charge. In this control sample, the QCD 
multijet distribution and yield are obtained by subtracting from the data the contribution of 
the Drell-Yan, tt, and W+jets processes, estimated as explained above. The expected contri- 
bution of the QCD multijet background in the opposite-charge signal sample is then derived 
by rescaling the yield obtained in the same-charge control sample by a factor of 1.1, which is 
measured in the data using a pure QCD multijet sample obtained by inverting the lepton isola- 
tion and relaxing the T/, isolation. The 10% systematic uncertainty in this factor covers its small 
dependence on Pt{t}i) and the statistical uncertainty in its measurement, and dominates the 
uncertainty in this background contribution. In the VBF category, the number of events in the 
same-charge control sample is too small to use this procedure. Instead, the QCD multijet yield 
is obtained by multiplying the inclusive QCD yield by the VBF selection efficiency measured 
in data using a QCD-dominated sample in which the lepton and the Th are not isolated. The 
mass shape template is obtained from data by relaxing the muon and isolation criteria. 

The small background from W+jets and QCD multijet events in the e^ channel is estimated 
from the number of events with one identified lepton and a second lepton that passes relaxed 
selection criteria, but fails the nominal lepton selection. This number is converted to the ex- 
pected background yield using the efficiencies for such loosely identified lepton candidates to 
pass the nominal lepton selection. These efficiencies are measured in data using QCD multijet 
events. 

Finally, the small background contribution in each channel from diboson and single top-quark 
production is estimated using the simulation. The main experimental systematic uncertain- 
ties affecting the expected signal yield are from the Zh identification efficiency (7%), the E^^^ 
scale (5%), owing to the nij requirement and the £^^^^ selection applied to the 0- and 1-jet cate- 
gories of the eT}j channel, the integrated luminosity (5%), and the jet energy scale (< 4%). The 
uncertainties in the muon and electron selection efficiencies, including trigger, identification, 
and isolation, are both 2%. The theoretical uncertainty in the signal yield comes from the un- 
certainties in the PDFs, the renormalization and factorization scales, and the modelling of the 
underlying event and parton showers. The magnitude of the theoretical uncertainty depends 
on the production process (gluon-gluon fusion, VBF, or associated production) and on the event 
category. In particular, the scale uncertainty in the VBF production yield is 10%. The scale un- 
certainty in the gluon-gluon fusion production yield is 10% in the l-jet/high-pj category and 
30% in the VBF category. The T;, (3%) and electron (1%) energy scale uncertainties cause an un- 
certainty in the m-^r spectrum shape, and are discussed in the next section. The muon energy 
scale uncertainty is negligible. 

8.5 Results 



The statistical methodology described in Section 10.1 is used to search for the presence of a 
SM Higgs boson signal, combining the five categories of the four final states in the 7 and 8 TeV 
data sets as forty independent channels in a binned likelihood based on the wJtt distributions 
obtained for each channel. Systematic uncertainties are represented by nuisance parameters 
in the likelihood. A log-normal prior is assumed for the systematic uncertainties affecting 
the background normalization, discussed in the previous section. The Tjj and electron energy 
scale uncertainties, which affect the shape of the m-i-T spectrum, are represented by nuisance 
parameters whose variation results in a continuous change of this shape [147J. 



Figures 28 and 29 show the observed m-cr distributions in the eT;„ }iTh, epi, and ptfi channels, 
for each event category, compared with the background predictions. The 7 and 8 TeV data sets 
are merged, as well as the low- and high-px subcategories of the 0- and 1-jet categories. The 
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binning given in the figures corresponds to the binning used in the Hkelihood. The background 
mass distributions are the result of the global maximum-likelihood fit under the background- 
only hypothesis. This fit finds the best set of values for the nuisance parameters to match 
the data, assuming no signal is present. The variation of the nuisance parameters is limited 
by the systematic uncertainties estimated for each of the background contributions and used 
as input to the fit. For example, in the VBF category of the ez/, channel, the most important 
nuisance parameters related to background normalization are the ones affecting the Z — )■ rr 
yield (t;, selection efficiency), the Z ^ ee yield (e T/, misidentification rate), the W+jets 
yield (extrapolation from the high nij to the low mj region), and the QCD yield (ratio between 
the yields in the opposite-charge and same-charge regions). The fit makes use of the high-mT-T 
region of the VBF category to constrain the nuisance parameters affecting the W+jets yield. The 
nuisance parameter related to the t;, identification efficiency is mostly constrained by the 0- and 
1-jet categories, where the number of events in the Z — )■ tt peak is much larger. It is also the 
case for the nuisance parameter related to the T}, energy scale, which affects the shape of the 
Z — > TT distribution. 

The interpretation of the results in terms of upper limits on the Higgs boson production cross 
section is given in Section 10. 



9 H^bb 

The decay H — )■ bb has the largest branching fraction of the five search modes for mn < 
135 GeV, but the signal is overwhelmed by the QCD multijet production of b quarks. The 
analysis is therefore designed to search for a dijet resonance in events where a Higgs boson 
is produced at high pj, in association with a W or Z boson that decays leptonically, which 
largely suppresses the QCD multijet background. The following final states are included in the 
search: W(/^i/)H, W(ei/)H, Z{mi)H, Z(ee)H, and Z{vv)H, all with the Higgs boson decaying to 
bb. Backgrounds arise from the production of vector bosons in association with jets (from all 
quark flavours), singly- and pair-produced top quarks, dibosons, and QCD multijet processes. 
Simulated samples of signal and background events are used to optimize the analysis. Control 
regions in data are selected to adjust the predicted event yields from simulation for the main 
background processes and to estimate their contribution in the signal region. 

Several different high-level triggers are used to collect events consistent with the signal hy- 
pothesis in all five channels. For the WH channels, the trigger paths consist of several single- 
lepton triggers with tight lepton identification. Leptons are also required to be isolated from 
other tracks and calorimeter energy depositions to maintain an acceptable trigger rate. For the 
W(^v)H channel, in the 7TeV data set, the trigger thresholds for the muon transverse mo- 
mentum, Pj, vary from 17 to 40 GeV. The higher thresholds are implemented for periods 
of higher instantaneous luminosity. For the 8 TeV data set, the muon pj threshold is 24 GeV 
for the isolated-muon trigger, and 40 GeV for muons without any isolation requirements. The 
combined single-muon trigger efficiency is ^90% for signal events that pass all offline require- 



ments, described in Section 9.1 For the W(ei/)H channel, in the 7 TeV data set, the electron pj 
threshold ranges from 17 to 30 GeV. In addition, two jets and a minimum value on the miss- 
ing transverse energy are required. These additional requirements help to maintain acceptable 
trigger rates during the periods of high instantaneous luminosity. For the 8 TeV data set, a 
single-isolated-electron trigger is used with a 27 GeV pj threshold. The combined efficiency for 
these triggers for signal events that pass the final offline selection criteria is larger than 95%. 

The Z(f//^)H channel uses the same single-muon triggers as the W(/^i/)H channel. For the 
Z(ee)H channel, dielectron triggers with lower-px thresholds of 17 and 8 GeV and tight isola- 
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Figure 28: Observed (points with error bars) and expected (histograms) m-rr distributions for 
the eT/j (left) and fiTjj (right) channels, and, from top to bottom, the 0-jet, 1-jet, and VBF cate- 
gories for the combined 7 and 8 TeV data sets. In the 0- and 1-jet categories, the low- and high-px 
subcategories have been summed. The electroweak background combines the expected contri- 
butions from W+jets, Z+jets, and diboson processes. In the case of ex;,, the Z — > ee background 
is shown separately. The dotted histogram shows the expected distribution for a SM Higgs 
boson with nin = 125 GeV (multiplied by a factor of 5 for clarity). 
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Figure 29: Observed (points with error bars) and expected (histograms) m-rr distributions for 
the efi (left) and (right) channels, and, from top to bottom, the 0-jet, 1-jet, and VBF categories 
for the combined 7 and 8 TeV data sets. In the 0- and 1-jet categories, the low- and high-px 
subcategories have been summed. The electroweak background combines the contributions 
from W+jets, Z+jets, and diboson processes. In the case of the Z — > /^^ background is 
shown separately. The dotted histogram shows the expected distribution for a SM Higgs boson 
with Wh = 125 GeV (multiplied by a factor of 5 for clarity). 
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tion requirements are used. These triggers are ~ 99% efficient for ZH signal events that pass 
the final offline selection criteria. For the Z(vi/)H channel, combinations of several triggers are 
used, all with the requirement that the missing transverse energy be above a certain threshold. 
Additional jet requirements are made to keep the trigger rates acceptable as the luminosity in- 
creases and to reduce the £^'®® thresholds, in order to increase the signal acceptance. A trigger 
with E^^^^ > 150 GeV requirement is implemented for both the 7 and 8 TeV data sets. For the 
7 TeV data, triggers that require the presence of two jets with | // 1 < 2.6, px > 20 GeV, and £^^^ 
thresholds of 80 and 100 GeV, depending on the instantaneous luminosity, are also used. For 
the 8 TeV data set, a trigger that requires two jets, each with |//| < 2.6 and pj > 30 GeV, and 
£miss ^ gg QgY jg gigQ implemented. As the instantaneous luminosity increased further, this 
trigger was replaced by one requiring E^^^^ > 100 GeV, two jets with |?/| < 2.6, one with pj 
> 60 GeV and the other with pj > 25 GeV, the dijet pj > 100 GeV, and no jet with pj > 40 GeV 
within 0.5 radians in azimuthal angle of the £^'®® vector. For Z(i/v)H signal events with miss- 
ing transverse energy > 160 GeV, the overall trigger efficiency is ~ 98% with respect to the 
offline event reconstruction and selection described below. The corresponding efficiency for 
120 < Ef^' < 160 GeV is about 66%. 

9.1 Event selection 

The final-state objects used in the H — > bb event reconstruction are described in Section |3] 
Electron candidates are considered in the pseudorapidity range < 2.5, excluding the 1.44 < 
< 1.57 transition region between the ECAL barrel and endcaps. Tight muon candidates are 
considered in the |7/| < 2.4 range. An isolation requirement on R^^ of approximately 10%, as 
calculated in Eq. ([T]), that is consistent with the expectation for leptons originating from W and 
Z boson decays, is applied to electron and muon candidates. The exact requirement depends 
on the lepton rj, pj, and flavour. To identify b jets, different values for the CSV output discrimi- 
nant, which can range between and 1, are used, with corresponding different efficiencies and 
misidentification rates. For example, with a CSV > 0.90 requirement, the efficiencies to tag b 
quarks, c quarks, and light quarks, are 50%, 6%, and 0.15%, respectively [|31J . The correspond- 
ing efficiencies for CSV > 0.50 are 72%, 23%, and 3%. All events from data and simulation 
are required to pass the same trigger and event reconstruction algorithms. Scale factors that 
account for differences in the performance of these algorithms between data and simulation 
are computed and used in the analysis. 

The background processes to VH production are V+jets, tt, single-top-quark, diboson (VV), 
and QCD multijet production. These overwhelm the signal by several orders of magnitude. 
The event selection is based on the kinematic reconstruction of the vector boson and the Higgs 
boson decay into two b-tagged jets. Backgrounds are then substantially reduced by requiring 
a significant boost of the px of the vector boson and the Higgs boson [148], which tend to 
recoil from each other with a large azimuthal opening angle, A(^(V,H), between them. For each 
channel, two ranges of Pt(V) are considered. These are referred to as "low" and "high". Owing 
to different signal and background compositions, each Pt(V) range has a different sensitivity, 
and the analysis is performed separately for each range. The results from all the ranges are 
then combined for each channel. The ranges for the WH channels are 120 < Pt(V) < 170 GeV 
and Pt(V) > 170 GeV, for the Z{vv)H channel 120 < Pt(V) < 160 GeV and Pt(V) > 160 GeV, 
and for the Z{U)H channel 50 < Pt(V) < 100 GeV and Pt(V) > 100 GeV. 

Candidate W — > £v decays are identified by requiring the presence of a single isolated lepton 
and missing transverse energy. Muons (electrons) are required to have a px above 20 (30) GeV. 
For the W(ev)H channel only, to reduce contamination from QCD multijet processes, E^'*® is 
required to be greater than 35 GeV. Candidate Z ^ £i decays are reconstructed by combin- 
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ing isolated, oppositely charged pairs of electrons or muons with pj > 20 GeV and a dilepton 
invariant mass satisfying 75 < nigg < 105 GeV. The identification of Z — )■ i/v decays requires 
the £^'*® in the event to be within the Pt(V) ranges described above. Two requirements sup- 
press events from QCD multijet processes with an £^^^ arising from mismeasured jets. First, 
the £^'®® vector must be isolated from jet activity, using the requirement that the azimuthal 
angle difference A^(E!^^*,j) between the E^^^ direction and any jet with |?/| < 2.5 and pj >20 
(30) GeV be greater that 0.5 radians for the 7 (8) TeV data sample. Second, the azimuthal an- 
gle between the E^^^^ vector calculated using only charged particles with pj > 0.5 GeV and 
\}]\ < 2.5 and the direction of the standard E^'^^ vector (calculated using all particles, charged 
and neutral) must be greater than 0.5 radians. Subject to these two requirements, background 
from QCD multijet processes is reduced to a negligible level in the Z(t/i/)H channel. To re- 
duce the tt and WZ background in the WH and Z(vv)H channels, events where the number of 
additional isolated leptons with pj > 20 GeV is greater than are rejected. 

Reconstruction of the H — > bb decay is done by requiring two jets above the minimum pj 



thresholds listed in Table 10 having \r]\ < 2.5, and tagged by the CSV algorithm. If more than 
two such jets are found in the event, the pair with the highest total dijet transverse momentum, 
pxOj)/ is selected. The background from V+jets and dibosons is reduced significantly through 
b tagging, and subprocesses where the two jets originate from genuine b quarks dominate the 
final selected data sample. After all the event selection criteria are applied, the invariant-mass 
resolution for the Higgs boson decay to bb is approximately 10%, as found in a previous CMS 
analysis 1,149 1. The mass resolution is improved here by applying regression techniques simi- 
lar to those used by the CDF experiment II150| . Through this procedure, a further correction, 
beyond the standard jet energy corrections, is computed for individual b jets in order to bet- 
ter measure the true parton energy. A BDT algorithm is trained on simulated H — > bb signal 
events, with inputs that include detailed information about each jet that helps to differenti- 
ate b-quark jets from light-flavour jets. The resulting improvement in the bb invariant-mass 
resolution is approximately 15%, resulting in an increase in the analysis sensitivity of 10-20%, 
depending on the specific channel. The BDT regression is implemented in the TMVA frame- 
work [40}. The complete set of input variables is (though not all variables are used for every 
channel): 

• transverse momentum of the jet before and after energy corrections; 

• transverse energy and mass of the jet after energy correction; 

• uncertainty in the jet energy correction; 

• transverse momentum of the highest-px constituent in the jet; 

• pseudorapidity of the jet; 

• total number of jet constituents; 

• length and uncertainty of the displacement of the jet's secondary vertex; 

• mass and transverse momentum of the jet's secondary vertex; 

• number and fraction of jet constituents that are charged; 

• event energy density, p, calculated using constituents with |f/| < 2.5; 

• missing transverse energy in the event; 

• azimuthal angle between the missing transverse energy vector and the direction of 
the nearest jet in pseudorapidity. 

To better discriminate the signal from background for different Higgs boson mass hypotheses, 
an event classification BDT algorithm is trained separately for each mass value using simu- 
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Table 10: Selection criteria for the simulated event samples used in training of the signal and 
background BDT algorithm. Variables marked "-" are not used in the given channel. Entries 
in parentheses indicate the selection for the high-px(V) range. The second and third rows refer 
to the Pt threshold for the highest- and second-highest- px jet, respectively, for the pair with the 
highest total dijet transverse momentum, Pt(]])- The parameter Nai is the number of additional 
isolated leptons in the event. Kinematic variables are given in GeV and angles in radians. 



Variable 


W(£t/)H 


Z(£^)H 


Z(vv)H 






G [75 - 105] 




PtOi) 


> 30 


> 20 


> 80 


PtOi) 


> 30 


> 20 


> 30 


piOi) 


> 120 




G [120-160] (> 160) 




< 250 


G [80 - 150] (-) 


< 250 


Pt(V) 


G [120-170] (> 170) 


G [50-100] (> 100) 




CSVmax 


> 0.40 


0.50 (0.244) 


> 0.679 


CSVjfiin 


> 0.40 


0.244 


> 0.244 




= 




= 


^miss 


> 35(e) 




G [120-160] (> 160) 








> 0.5 


A(/)(V,H) 






> 2.0 



Table 11: Variables used for training the signal and background BDT algorithm. 



Variable definition 

pXj transverse momentum of each b jet from the Higgs boson decay 

m(jj) dijet invariant mass 

pT(jj) dijet transverse momentum 

px (V) vector boson transverse momentum 

CSVmax value of CSV for the b-tagged jet with the largest CSV value 

CSVmin value of CSV for the b-tagged jet with the second largest CSV value 

A(/)(V,H) azimuthal angle between the vector boson (or vector) and the dijet direction 

I A/y (jj) I difference in rj between b jets from Higgs boson decay 

AK(ji, j2) distance in rj-(p between b jets from Higgs boson decay (not for Z(££)H) 

Naj number of additional jets 



A(^(E!p'**, j) azimuthal angle between £^'®® and the closest jet (only for Z(vv)H) 



lated samples of signal and background events that pass the selection criteria described above, 
together with the requirements listed in Table 10 The set of input variables used in training 
this BDT is chosen by iterative optimization from a larger number of potentially discriminating 



variables. Table 11 lists these variables. The number Naj of additional jets in an event counts 
jets that satisfy px > 20 GeV and \r]\ < 4.5 for W(^i/)H, px > 20 GeV and |//| < 2.5 for Z{U)H, 
or Px > 30 GeV and < 4.5 for Z(i/v)H. The output distribution of this BDT algorithm is 
fitted to search for events from Higgs boson production. Fitting this distribution, rather than 
simply counting events in a range of the distribution with a good signal-to-background ratio, 
as in Ref. [149| , improves the sensitivity of the analysis by approximately 20%. 
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Table 12: Definitions of the control regions for the simulated sample of Z+jets and tt back- 
grounds in the Z(£^)H channel. The same selection is used for the low- and high-px(V) ranges. 
The values of kinematical variables are in GeV. 



Variable Z+jets tt 

e [75 - 105] ^ [75 - 105] 

PT(ji) > 20 > 20 

pT(j2) > 20 > 20 

Pt(V) e [50 -100] G [50 -100] 

CSVmax > 0.244 > 0.244 

CSVmin > 0.244 > 0.244 



m(jj) i [80 - 150], < 250 j [80 - 150], < 250 



9.2 Background control regions 

Control regions are identified in the data and used to correct the estimated yields from the 
MC simulation for two of the important background processes: tt production and V+jets, orig- 
inating from light-flavour partons (u, d, s, or c quarks and gluons) or from heavy-flavour (b 
quarks). Simultaneous fits are then performed to the distributions of the discriminating vari- 
ables in the control regions to obtain scale factors by which the simulation yields are adjusted. 
This procedure is performed separately for each channel. For the Z(££)H and WH modes the 
scale factors derived for the electron and muon decay channels are combined. These scale fac- 
tors account not only for possible simulation cross-section discrepancies with the data, but also 
for potential differences in the selection efficiencies for the various physics object. Therefore, 
separate scale factors are used for each background process in the different channels. The un- 
certainties in the scale factor determination include a statistical uncertainty from the fits (owing 
to the finite size of the samples) and an associated systematic uncertainty. The latter is esti- 
mated by refitting the distributions in the control regions after applying estimates for sources 
of potential systematic shifts such as b-jet-tagging efficiency, jet energy scale, and jet energy 
resolution. 



Tables 12 -14 list the selection criteria used for the control regions in the Z(££)H , Z(vv)H and 



WH channels, respectively. Table 15 summarizes the fit results for all channels separately for 
the 7 TeV and 8 TeV data sets. The fit results are found to be robust and the fitted scale factors 
are consistent with the values from the previous analysis [|149| . 

9.3 Systematic uncertainties 

Sources of systematic uncertainty in the expected signal and background yields and distribu- 



tion shapes are listed in Table 16 The uncertainty in the integrated luminosity measurement 
is 2.2% for the 7TeV data [151J and 4.4% for the 8 TeV data [135J. Muon and electron trigger, 
reconstruction, and identification efficiencies are determined in data from samples of leptonic 
Z boson decays. The uncertainty in the yields due to the trigger efficiency is 2% per charged 
lepton and the uncertainty in the identification efficiency is also 2% per lepton. The parameters 
describing the Z(i/i/)H trigger efficiency turn-on curve are varied within their statistical uncer- 
tainties and for different assumptions on the methodology. A 2% systematic uncertainty in the 
yield is estimated. 

The jet energy scale is varied by ±1 standard deviation as a function of the jet px arid r], and the 
efficiency of the analysis selection is recomputed. A 2-3% yield variation is found, depending 
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Table 13: Definitions of the control regions for the simulated samples of V+jets and tt back- 
ground processes in the Z(i/v)H channel for the low- and high-px(V) regions. The values in 
parentheses are for the high-px(V) region. The labels LF and HF refer to light- and heavy- 
flavour jets. The parameter N^i is the number of additional isolated leptons in the event. The 
values for kinematical variables are in GeV. 



Variable Z+jets (LF) Z+jets (HF) tf W+jets (LF) W+jets (HF) 

PtOi) > 60(> 80) > 60(> 80) > 60(> 80) > 60(> 80) > 60(> 80) 

PT(j2) > 30 > 30 > 30 > 30 > 30 

PT(jj) > 120(> 160) > 120(> 160) > 120(> 160) > 120(> 160) > 120(> 160) 

CSVmax - >0.898 >0.898 - >0.898 

Naj - - 1 

Nai 1 1 1 

Ef'^'' £[120-160] (> 160) £[120-160] (> 160) £[120-160] (> 160) £[120-160] (> 160) £[120-160] (> 160) 

m(jj) - ^[90-150] ^[90-150] - ^[90-150] 



Table 14: Definitions of the control regions for the simulated samples of three background pro- 
cesses in the W(^i/)H channel for the low- and high-px(V) regions. The values in parentheses 
are used for the high-px(V) region. The labels LF and HF refer to light- and heavy-flavour jets. 
The parameter Nai is the number of additional isolated leptons in the event, and IMETsig is the 
ratio of the £^'®® value to its uncertainty [144J. The values for kinematical variables are in GeV. 
The symbols e and ji mean that the selection is used only for the W(ei/)H mode or W(f/v)H 
mode, respectively. 



Variable 


W+jets (LF) 


tt 


W+jets (HF) 


PtOi) 


>30 


>30 


>30 


PT(j2) 


>30 


>30 


>30 


PT(jj) 


>120 


>120 


>120 


px(V) 


e [120 - 170] (>170) 


e [120- 170] (>170) 


G [120 - 170] (>170) 


CSVmax 




>0.898 


>0.898 


Naj 


<2 


>1 


=0 


Nai 


=0 


=0 


=0 


£miss 


>35 (e) 


>35 (e) 


>35 (e) 


IVIETsig 


>2.0{^), >3.0(e) 






m(jj) 


<250 


<250 


^ [90 - 150] 
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Table 15: Data/MC scale factors for the control region in each Higgs boson production process 
with the 7TeV and 8TeV data sets in the low- and high-px(V) ranges. The uncertainties shown 
are statistical and systematic, respectively. The labels LF and HF refer to light- and heavy- 
flavour jets. 



Process 


WH 


WH 


Z(«)H 


Z(«)H 


Z(i/i')H 


Z{vv)H 


Low-pt(V) 


7TeV 


8TeV 


7TeV 


8TeV 


7TeV 


8TeV 


W+jets (LF) 


0.88 ± 0.01 ± 0.03 


0.97 ±0.01 ±0.03 






0.89 ± 0.01 ± 0.03 


0.91 ± 0.03 ± 0.03 


W+jets (HF) 


1.91 ±0.14 ±0.31 


2.05 ±0.21 ±0.33 






1.36 ±0.10 ±0.15 


1.63 ±0.29 ±0.14 


Z+jets (LF) 






1.11 ±0.03 ±0.11 


1.41 ± 0.03 ± 0.16 


0.87 ± 0.01 ± 0.03 


1.01 ±0.05 ±0.03 


Z+jets (HF) 






0.98 ± 0.05 ± 0.12 


1.04 ± 0.05 ± 0.20 


0.96 ± 0.02 ± 0.03 


1.00 ±0.10 ±0.04 


tf 


0.93 ± 0.02 ± 0.05 


1.12 ±0.01 ±0.06 


1.03 ±0.04 ±0.11 


1.06 ±0.03 ±0.11 


0.97 ± 0.02 ± 0.04 


1.02 ±0.03 ±0.03 


High-pT(V) 


7TeV 


8TeV 


7TeV 


8TeV 


7TeV 


8TeV 


W+jets (LF) 


0.79 ± 0.01 ± 0.02 


0.88 ± 0.01 ± 0.02 






0.78 ± 0.02 ± 0.03 


0.86 ± 0.03 ± 0.03 


W+jets (HF) 


1.49 ±0.14 ±0.19 


1.30 ±0.20 ±0.17 






1.48 ±0.15 ±0.20 


1.43 ±0.28 ±0.18 


Z+jets (LF) 






1.11 ±0.03 ±0.11 


1.41 ±0.03 ±0.16 


0.97 ± 0.02 ± 0.04 


1.01 ±0.04 ±0.04 


Z+jets (HF) 






0.98 ± 0.05 ± 0.12 


1.03 ± 0.05 ± 0.20 


1.08 ± 0.09 ± 0.06 


1.06 ±0.06 ±0.07 


tf 


0.84 ± 0.02 ± 0.03 


0.97 ±0.02 ±0.03 


1.03 ± 0.04 ± 0.11 


1.06 ±0.03 ±0.11 


0.97 ± 0.02 ± 0.04 


1.03 ±0.04 ±0.04 



on the particular decay channel and production process. The effect of the uncertainty in the jet 
energy resolution is evaluated by smearing the jet energies by the measured uncertainty, giving 
a 3-6% variation in yields. The uncertainties in the jet energy scale and resolution also affect 
the shape of the BDT output distribution. The impact of the jet energy scale uncertainty is de- 
termined by recomputing the BDT distribution after shifting the energy scale up and down by 
its uncertainty. Similarly, the impact of the jet energy resolution is determined by recomputing 
the BDT distribution after increasing or reducing the jet energy resolution. 

Data-to-simulation b-tagging-efficiency scale factors, measured in tt events and multijet events, 
are applied to the jets in signal and background events. The estimated systematic uncertainties 
in the b-tagging scale factors are: 6% per b tag, 12% per c tag, and 15% per mistagged jet 
(originating from gluons and light quarks) ||31| . These translate into yield uncertainties in the 3- 
15% range, depending on the channel and the production process. The shape of the BDT output 
distribution is also affected by the shape of the CSV distribution, and therefore recomputed 
according to the range of variations of the CSV distributions. 

The theoretical VH signal cross section is calculated to NNLO, and the systematic uncertainty 
is 4% II25II , including the effects of scale and PDF variations [|95ti99| . The analysis described in 
this paper is performed in the regime where the V and H have a significant boost in pj, and 
thus, potential differences in the pj spectrum of the V and H between the data and the MC 
simulation generators could introduce systematic effects in the estimates of the signal accep- 
tance and efficiency. Theoretical calculations are available that estimate the NLO electroweak 
(EW) ||83lll52l|l53l and NNLO QCD [84] corrections to VH production in the boosted regime. 
The estimated effect from electroweak corrections for a boost of ~150GeV are 5% for ZH and 
10% for WH. For the QCD correction, a 10% uncertainty is estimated for both ZH and WH, 
which includes effects due to additional jet activity from initial- and final-state radiation. The 
finite size of the signal MC simulation samples, after all selection criteria are applied, con- 
tributes an uncertainty of 1-5% in the various channels. 

The total uncertainty in the prediction of the background yields from estimates using data is 
approximately 10%. For the V+jets background, the differences in the BDT output distribution 
for events from the MadGraph and HERWIG++ MC simulation generators are considered. For 
the single-top-quark and diboson yield predictions, which are obtained solely from simulation, 
a 30% systematics uncertainty in the cross sections is used. 
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Table 16: Systematic uncertainties in the predicted signal and background yields from the 
sources listed. The ranges give the variations over the 7 and 8 TeV data sets, different search 
channels, specific processes, and Higgs boson mass hypotheses. The acronym EWK stands for 
electroweak. 



Source Range (%) 



Integrated luminosity 


2.2-4.4 


Lepton identification and trigger efficiency (per lepton) 


3 


Z(t/v)H triggers 


2 


Jet energy scale 


2-3 


Jet energy resolution 


3-6 


Missing transverse energy 


3 


b-tagging efficiency 


3-15 


Signal cross section (scale and PDF) 


4 


Signal cross section (px boost, EWK/ QCD) 


5-10/10 


Statistical precision of signal simulation 


1-5 


Backgrounds estimated from data 


10 


Backgrounds estimated from simulation 


30 



9.4 Results 

Maximum-likelihood fits are performed to the output distributions of the BDT algorithms, 
trained separately for each channel and each Higgs boson mass value hypothesis in the 110- 
135 GeV range. In the fit, the BDT shapes and normalizations, for signal and each background 
component, are allowed to vary within the systematic and statistical uncertainties described in 



Section 9.3 These uncertainties are treated as nuisance parameters, with appropriate correla- 



tions taken into account. 



Tables 17-20 summarize the expected signal and background yields for both Pt(V) bins in each 



channel from the 7 TeV and 8 TeV data. All the data/MC scale factors determined in Section 9.2 



have been applied to the corresponding background yields. Examples of output BDT distribu- 



tions, for the jwh = 125 GeV training and for the high Pt(V) bin, are shown in Figure 30 The 
signal and background shapes and normalizations are those returned by the fits. Figure 30 also 
shows the dijet invariant-mass distribution for the combination of all five channels in the com- 
bined 7 and 8 TeV data sets, using an event selection that is more restrictive than the one used 
in the BDT analysis and that is more suitable for a counting experiment in just this observable. 
The events considered are those in the high Pt(V) bin with tighter b-tagging requirements on 
both jets, and with requirements that there be no additional jets in the events and that the az- 
imuthal opening angle between the dijet system and the reconstructed vector boson be large. 
The H — ?► bb search with such a selection is significantly less sensitive than the search using the 
BDT discriminant and it is therefore not elaborated on further in this article. 

The interpretation of the results from the BDT discriminant analysis, in terms of upper limits 
on the Higgs boson production cross section, is given in Section 10. 



10 Combined results 

In this section, we present the results obtained by combining the measurements from all five 
search channels described above. We begin with a short summary of the statistical method 
used to combine the analyses. 
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Table 17: Predicted signal and background yields (statistical imcertainty only) in the BDT out- 
put distribution for the Iow-pt(V) range with the 7TeV data for each of the five channels. The 
labels LP and HP refer to light- and heavy-flavour jets. The numbers in parentheses refer to the 
Higgs boson mass hypothesis in GeV. 



Process 




Z(ee)H 


Z(i/i/)H 


W(f/1/)H 


W(ei')H 


Z+jets (LP) 


176 ± 14 


255 ± 18 


158.3 ± 6.1 


11.0 ±1.5 


1.87 ±0.56 


Z+jets (HP) 


235 ± 16 


225 ± 16 


254.9 ±5.5 


23.2 ± 2.1 


2.71 ± 0.68 


W+jets (LP) 


- 


- 


133.1 ± 8.1 


124.6 ± 4.6 


58.5 ±3.1 


W+jets (HP) 


- 


- 


171.85 ± 7.1 


248.3 ± 9.5 


135.3 ± 7.0 


tt 


74.2 ± 1.9 


64.3 ±1.7 


898.5 ± 5.2 


894.6 ± 4.1 


575.5 ± 3.3 


Single Top 


3.73 ± 0.72 


2.67 ±0.64 


98.5 ± 5.9 


123.1 ± 3.0 


67.7 ±2.2 


W 


10.77 ±0.53 


10.07 ±0.55 


33.5 ± 1.5 


15.10 ± 0.72 


7.89 ± 0.54 


ZH(llO) 


2.72 ± 0.03 


2.19 ± 0.03 


6.19 ±0.05 


0.28 ± 0.02 


0.08 ±0.01 


WH(no) 






3.19 ±0.04 


4.98 ± 0.08 


2.96 ± 0.06 


ZH(n5) 


2.34 ± 0.03 


1.88 ± 0.03 


4.52 ± 0.05 


0.21 ± 0.01 


0.07 ±0.01 


WH(115) 






2.36 ± 0.03 


4.57 ± 0.07 


2.58 ± 0.05 


ZH(120) 


1.93 ± 0.02 


1.56 ± 0.02 


4.10 ± 0.04 


0.19 ± 0.01 


0.07 ± 0.01 


WH(120) 






2.15 ± 0.04 


3.90 ± 0.05 


2.17 ±0.04 


ZH(125) 


1.52 ±0.02 


1.23 ±0.02 


3.67 ±0.04 


0.18 ±0.01 


0.06 ±0.01 


WH(125) 






1.94 ± 0.04 


3.19 ± 0.04 


1.90 ±0.03 


ZH(130) 


1.19 ±0.01 


0.95 ± 0.01 


2.81 ± 0.04 


0.15 ± 0.01 


0.05 ± 0.01 


WH(130) 






1.25 ± 0.03 


2.56 ± 0.04 


1.50 ± 0.03 


ZH(135) 


0.83 ± 0.01 


0.67 ±0.01 


2.10 ±0.02 


0.11 ±0.01 


0.03 ± 0.01 


WH(135) 






0.87 ±0.02 


1.92 ±0.02 


1.13 ±0.02 


Sum 


500 ±22 


558 ± 24 


1749 ± 16 


1440 ± 12 


850 ±9 


Data 


493 


512 


1793 


1411 


925 



Table 18: Predicted signal and background yields (statistical uncertainty only) in the BDT out- 
put distribution for the high-px(V) range with the 7TeV data for each of the five channels. The 
labels LP and HP refer to light- and heavy-flavour jets. The numbers in parentheses refer to the 
Higgs boson mass hypothesis in GeV. 



Process 




Z(ee)H 


Z(i/y)H 


W(//i/)H 


W(ei/)H 


Z+jets (LP) 


291 ± 15 


275 ± 15 


107.7 ±3.1 


3.47 ± 0.79 


1.63 ± 0.52 


Z+jets (HP) 


180 ± 11 


160 ± 10 


117.0 ±4.6 


6.7 ± 1.2 


2.13 ± 0.59 


W+jets (LP) 






81.4 ±3.8 


61.9 ± 3.0 


41.4 ±2.5 


W+jets (HP) 






171.7 ±5.9 


129.5 ± 6.1 


67.8 ± 4.4 


tt 


41.7 ±1.4 


39.4 ±1.3 


275.7 ± 3.0 


302.4 ± 2.3 


225.0 ± 2.0 


Single Top 


1.49 ±0.45 


3.44 ± 0.71 


37.9 ± 3.4 


60.8 ±2.1 


41.6 ± 1.7 


W 


14.02 ± 0.67 


11.68 ±0.60 


24.6 ± 2.8 


9.71 ± 0.58 


6.28 ± 0.47 


ZH(llO) 


3.19 ± 0.04 


2.69 ± 0.03 


5.75 ± 0.04 


0.14 ±0.01 


0.07 ±0.01 


WH(llO) 






1.88 ± 0.06 


4.39 ± 0.07 


3.18 ± 0.06 


ZH(115) 


2.78 ± 0.03 


2.37 ±0.027 


5.87 ±0.05 


0.08 ±0.01 


0.04 ±0.01 


WH(115) 






1.71 ± 0.05 


3.93 ± 0.06 


2.82 ± 0.05 


ZH(120) 


2.41 ± 0.02 


2.09 ± 0.023 


5.15 ± 0.04 


0.10 ±0.01 


0.06 ± 0.01 


WH(120) 






1.42 ± 0.04 


3.57 ±0.05 


2.51 ± 0.04 


ZH(125) 


1.99 ± 0.02 


1.67 ±0.02 


4.46 ± 0.04 


0.08 ± 0.01 


0.04 ±0.01 


WH(125) 






1.15 ± 0.03 


3.04 ± 0.04 


2.14 ± 0.04 


ZH(130) 


1.58 ± 0.02 


1.37 ±0.01 


3.54 ± 0.03 


0.06 ± 0.01 


0.04 ± 0.01 


WH(130) 






0.70 ± 0.02 


2.51 ± 0.04 


1.83 ±0.03 


ZH(135) 


1.24 ± 0.01 


1.03 ± 0.01 


2.76 ± 0.02 


0.05 ± 0.01 


0.03 ± 0.01 


WH(135) 






0.77 ±0.02 


1.94 ± 0.03 


1.39 ± 0.02 


Sum 


529 ± 19 


490 ± 18 


816 ± 10 


575 ±6 


386 ±6 


Data 


565 


491 


783 


551 


383 
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10 Combined results 



Table 19: Predicted signal and background yields (statistical uncertainty only) in the BDT out- 
put distribution for the Iow-pt(V) range with the 8 TeV data for each of the five channels. The 
labels LF and HF refer to light- and heavy-flavour jets. The numbers in parentheses refer to the 
Higgs boson mass hypothesis in GeV. 



Process 




Z(ee)H 


Z{vv)H 


W(/Yi/)H 


W(ei/)H 


Z+jets (LF) 


296 ±20 


254 ±23 


156.3 ± 2.6 


13.7 ±2.5 


6.7 ±1.9 


Z+jets (HV) 


250 ± 15 


228 ± 17 


355.1 ± 4.7 


21.7 ±2.9 


8.5 ±2.0 


W+jets (LF) 


- 


- 


202.6 ± 3.1 


92.8 ± 6.9 


58.2 ± 5.7 


W+jets (HV) 


- 


- 


384.6 ± 5.0 


177.7 ± 14.1 


102.4 ±10.7 


tt 


86.3 ± 3.7 


75.7 ± 3.6 


1573 ± 29 


1308 ± 15 


970 ± 13 


Single Top 


5.4 ±1.9 


2.45 ± 0.82 


102.2 ± 2.2 


64.3 ± 5.2 


49.6 ±4.8 


W 


13.7 ±1.1 


12.4 ± 1.0 


48.3 ± 2.5 


19.0 ±1.8 


13.0 ± 1.8 


ZH(llO) 


2.83 ± 0.06 


2.21 ± 0.05 


7.78 ± 0.02 


0.31 ± 0.02 


0.14 ±0.01 


WH(llO) 






1.14 ±0.02 


4.87 ±0.17 


3.39 ± 0.14 


ZH(115) 


2.37 ±0.05 


1.89 ± 0.04 


6.64 ± 0.02 


0.25 ± 0.01 


0.11 ±0.01 


WH(115) 






1.11 ±0.02 


4.73 ± 0.15 


3.28 ± 0.13 


ZH(120) 


1.92 ± 0.04 


1.54 ±0.03 


5.78 ± 0.04 


0.23 ± 0.01 


0.10 ± 0.01 


WH(120) 






1.07 ±0.02 


3.79 ± 0.12 


2.59 ± 0.10 


ZH(125) 


1.52 ±0.03 


1.24 ±0.03 


4.39 ± 0.02 


0.18 ±0.01 


0.08 ± 0.01 


WH(125) 






0.95 ± 0.03 


3.19 ±0.10 


2.41 ± 0.09 


ZH(130) 


1.15 ± 0.02 


0.92 ± 0.02 


3.37 ± 0.04 


0.15 ± 0.01 


0.05 ± 0.01 


WH(130) 






0.79 ± 0.03 


2.61 ± 0.09 


1.85 ± 0.08 


ZH(135) 


0.83 ± 0.02 


0.65 ± 0.02 


2.31 ± 0.03 


0.11 ±0.01 


0.04 ± 0.01 


WH(135) 






0.61 ± 0.02 


1.85 ±0.06 


1.40 ±0.05 


Sum 


651 ±26 


572 ± 29 


2822 ± 30 


1697 ± 22 


1208 ± 19 


Data 


707 


547 


2804 


1727 


1289 



Table 20: Predicted signal and background yields (statistical uncertainty only) in the BDT out- 
put distribution for the high-px(V) range with the 8 TeV data for each of the five channels. The 
labels LF and HF refer to light- and heavy-flavour jets. The numbers in parentheses refer to the 
Higgs boson mass hypothesis in GeV. 



Process 




Z(ee)H 


Z(i/i/)H 


W(^i/)H 


W(ei/)H 


Z+jets (LF) 


426 ± 17 


353 ± 16 


109.6 ± 3.0 


4.1 ±1.2 


1.33 ±0.41 


Z+jets (HF) 


238 ± 11 


199 ± 10 


182.0 ± 3.6 


6.3 ±1.4 


3.17 ±0.99 


W+jets (LF) 






79.0 ± 2.8 


42.8 ± 4.8 


32.2 ± 4.4 


W+jets (HV) 






97.4 ± 4.9 


64.4 ± 7.1 


45.7 ±5.9 


tt 


55.0 ±3.0 


48.0 ± 2.8 


488 ± 16 


458.8 ±8.1 


361.8 ±7.4 


Single Top 


4.5 ± 1.5 


5.9 ± 2.2 


43.2 ± 2.4 


35.6 ±4.4 


28.6 ±4.1 


VV 


16.5 ±1.3 


13.4 ±1.2 


34.8 ± 1.8 


16.1 ±1.7 


9.0 ±1.2 


ZH(llO) 


3.66 ± 0.06 


2.95 ± 0.06 


8.05 ± 0.07 


0.14 ± 0.01 


0.08 ± 0.01 


WH(llO) 






2.63 ± 0.06 


4.49 ± 0.16 


3.92 ± 0.16 


ZH(115) 


3.17 ±0.05 


2.64 ± 0.05 


6.81 ± 0.05 


0.12 ±0.01 


0.06 ± 0.01 


WH(115) 






1.52 ± 0.05 


4.30 ± 0.14 


3.52 ±0.13 


ZH(120) 


2.77 ±0.04 


2.26 ± 0.04 


5.81 ± 0.04 


0.10 ±0.01 


0.06 ± 0.01 


WH(120) 






1.00 ± 0.04 


3.86 ± 0.12 


3.09 ± 0.11 


ZH(125) 


2.31 ± 0.04 


1.84 ± 0.03 


5.40 ± 0.04 


0.09 ± 0.01 


0.06 ± 0.01 


WH(125) 






0.74 ± 0.03 


3.29 ± 0.10 


2.67 ±0.09 


ZH(130) 


1.84 ± 0.03 


1.53 ± 0.03 


3.99 ± 0.03 


0.07 ±0.01 


0.04 ± 0.01 


WH(130) 






0.70 ± 0.02 


2.56 ± 0.09 


2.07 ± 0.08 


ZH(135) 


1.39 ± 0.02 


1.16 ± 0.02 


2.80 ± 0.02 


0.06 ± 0.01 


0.03 ± 0.01 


WH(135) 






0.67 ±0.02 


2.00 ± 0.06 


1.76 ± 0.06 


Sum 


740 ± 20 


620 ± 19 


1034 ± 18 


628 ± 13 


482 ± 11 


Data 


776 


635 


1045 


689 


544 
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Figure 30: Example of BDT output distributions in the high pj(V) bin, after all the selection 
criteria have been applied, for Z(f/^)H (top left), Z{vv)H (top right), and W(et/)H (bottom 
left). Bottom right: the b-tagged dijet invariant-mass distribution from the combination of all 
VH channels for the combined 7 and 8 TeV data sets. Only events that pass a more restrictive 
selection are included (see text). For all figures the solid histograms show the signal and the 
various backgrounds, with the hatched region denoting the statistical uncertainties in the MC 
simulation. The data are represented by points with error bars. The VH signal is represented by 
a red line histogram. The ratio of the data to the sum of the expected background distributions 
is shown at the bottom of each figure. 
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10 Combined results 



10.1 Combination methodology 

Combining the Higgs boson search results requires a simultaneous analysis of the data selected 
by the individual decay modes, accounting for their correlations and for all the statistical and 
systematic uncertainties. The statistical methodology used in this combination was developed 
by the ATLAS and CMS Collaborations in the context of the LHC Higgs Combination Group. 
A description of the general methodology can be found in Refs. II201I107I . Results presented in 
this paper are obtained using asymptotic formulae from Ref. [154| and recent updates available 
in the RooStats package [155|. The Higgs boson mass is tested in steps accordant with the 
expected Higgs boson width and the experimental mass resolution [,107j| . 

10.1.1 Characterizing the absence of a signal: limits 

For the calculation of exclusion limits, we adopt the modified frequentist criterion CLg |156l 
I157L The chosen test statistic q, used to determine how signal- or background-like the data are, 
is based on a profile likelihood ratio. Systematic uncertainties are incorporated via nuisance 
parameters and are treated according to the frequentist paradigm, as described in Ref II107I . 
The profile likelihood ratio is defined as 



£(obs I // • s + 0, 0) 

where "obs" stands for the observed data; s stands for the number and distribution of signal 
events expected under the SM Higgs boson hypothesis; f/ is a signal-strength modifier, intro- 
duced to accommodate deviations from the SM Higgs boson predictions; h is the number and 
distribution of background events; }i ■ s -\-h\s the signal-plus-background hypothesis, with the 
expected SM signal event yields s multiplied by the signal-strength modifier }i; 6 are nuisance 
parameters describing the systematic uncertainties. The value 9pt maximizes the likelihood in 
the numerator for a given j-i, while fi and 6 define the point at which the likelihood reaches its 
global maximum. 

The ratio of the probabilities to observe a value of the test statistic at least as large as the one 
observed in data, under the signal+background {]i ■ s + b) and background-only (b) hy- 
potheses, 

P(fl„ > qf^\u-s + b) 

CUM = r r I , s < (11) 

is used as the criterion for excluding the presence of a signal at the 1 — a confidence level. 

A signal with a cross section a = }i- ac,yi is defined to be excluded at 95% CL if CLs(^) < 0.05. 
Here, t7sM stands for the SM Higgs boson cross section. 

10.1.2 Characterizing an excess of events: p-values and significance 

To quantify the presence of an excess of events beyond what is expected for the background, 
we use a test statistic: 



10.1 Combination methodology 
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where the likelihood in the numerator is for the background-only hypothesis. The local statis- 
tical significance Ziocai of a signal-like excess is computed from the probability po 

po = P(^o > I b), (13) 
henceforth referred to as the local p-value, using the one-sided Gaussian-tail convention: 

po= / -^exp{-x^/2)dx. (14) 
•^Ziocai v27r 

In the Higgs boson search, we scan over the Higgs boson mass hypotheses and find the value 
giving the minimum local p-value pj^ap which describes the probability of a background fluc- 
tuation for that particular Higgs boson mass hypothesis. The probability to find a fluctuation 
with a local p-value lower or equal to the observed pj^™] anywhere in the explored mass range 
is referred to as the global p-value, Pgiobai- 

Pglobal = P(PO<pSSll^)- (15) 

The fact that the global p-value can be significantly larger than pj^^j is often referred to as 
the "look-elsewhere effect" (LEE). The global significance (and global p-value) of an observed 
excess can be evaluated following the method described in Ref. I|158l , using: 

Pgiobai = pSSi + C ■ e-^i-/2. (16) 

The constant C is found by generating a set of pseudo-experiments and using it to evaluate 
the global p-value corresponding to the p^^^i value observed in the data. Pseudo-experiments 
are a simulated outcome of an experiment obtained by randomly varying the average expected 
event yields and their distributions according to a specified model of statistical and systematic 
uncertainties. For example, a Poisson distribution is used to model statistical variations, while 
a Gaussian distribution is used to describe the systematic uncertainties. 

10.1.3 Extracting signal-model parameters 

The values of a set of signal-model parameters a (the signal-strength modifier }i is one of them) 
are evaluated from a scan of the profile likelihood ratio c]{a): 

^(a) = -2 1n ^(°^^l^(^) + ^-9 . (17) 
^ C{ohs\s{a) + b, 9) 

The values of the parameters a and G that maximize the likelihood £(obs | s{a) + h, 9), are 
called the best-fit set. The 68% (95%) CL interval for a given signal-model parameter a, is 
evaluated from q{ai) = 1 (3.84), with all other unconstrained model parameters treated as 
nuisance parameters. The two-dimensional (2D) 68% (95%) CL contours for pairs of signal- 
model parameters fl„ Uj are derived from q{ai,aj) = 2.3 (6.0). Note that the boundaries of the 
2D confidence-level region projected onto either parameter axis are not identical to the one- 
dimensional (ID) confidence intervals for this parameter. 
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10 Combined results 



10.2 Exclusion limits on the SM Higgs boson 
10.2.1 Results of searches in the five decay modes 



Figures 31 and 32 show the 95% CL upper limits on the signal-strength modifier, }i = a I ctsm/ as 
a function of Wh for the five decay modes: 77, ZZ, WW, tt, and bb. The observed values are 
shown by the solid lines. The SM Higgs boson mass regions where the line is below a I uc^y^ = 
1 are excluded at 95% CL. The dashed lines indicate the median of the expected results for 
the background-only hypothesis. The dark and light bands indicate the ranges in which the 
observed results are expected to reside in 68% and 95% of the experiments, should multiple 
experiments be performed under the background-only hypothesis. The probabilities for an 
observation to lie above and below the 68% (95%) bands are each 16% (2.5%). 

In the H — > 77 analysis, the SM Higgs boson signal is searched for in a simultaneous statistical 
analysis of the diphoton invariant-mass distributions for the eleven exclusive event classes: five 
classes (four untagged and one VBF-tagged) for the 7 TeV data and six classes (four untagged 
and two VBF-tagged) for the 8 TeV data, as described in Section|5] Figure 31 shows the 95% CL 



upper limits on the Higgs boson production cross section obtained in (a) the baseline analysis 
and the two alternatives analyses: (b) the cut-based analysis and (c) the sideband analysis. The 



observed limits in the sideband analysis [Fig. 31 (lower right)] are not smooth because, when 



changing the mass hypothesis, the event class boundaries move as well. This is true for the 
±2% signal window and each sideband window. This leads to events moving in and out of the 
classes in a discrete manner. Figure |3l] (top) shows that the H — )■ 77 search has reached the sen- 
sitivity for excluding the SM Higgs boson at 95% CL in the mass range 110-144 GeV, while the 
observed data exclude it in the following three mass ranges: 113-122 GeV, 128-133 GeV, and 
138-149 GeV. All three diphoton analyses give observed exclusion limits near mn = 125 GeV 
that are much weaker than the expected for the background-only hypothesis, which implies 
a significant excess of events with diphoton masses around 125 GeV. The consistency of the 
results obtained with the three alternative approaches confirms the robustness of the measure- 
ment. 

In the H — ?► ZZ — ?► A£ analysis, the SM Higgs boson signal is searched for in a simultaneous 
statistical analysis of six 2D distributions of the four-lepton invariant mass and the matrix- 
element-based kinematic discriminant Ko, as described in Section |6] The six distributions cor- 



respond to the three lepton final states (4e, 4f/, lelpi) and the 7 and 8 TeV data sets. Figure 32 
(upper left) shows the 95% CL upper limits on the Higgs boson production cross section. The 
H — >• ZZ — > A£ search has reached the sensitivity for excluding the SM Higgs boson at 95% CL 
in the mass range 120-180 GeV, while the observed data exclude it in the following two mass 
ranges: 130-164 GeV and 170-180 GeV. The observed exclusion limits for OTh = 120-130 GeV 
are much weaker than the expected limits for the background-only hypothesis, suggesting a 
significant excess of four-lepton events in this mass range. As a cross-check, the statistical anal- 
ysis using only the m^^i distributions has been performed. The results are found to be consistent 
with the 2D analysis, although with less sensitivity. 

In the H — > WW — > iv£v analysis, the SM Higgs boson signal is searched for in a simultaneous 
statistical analysis of eleven exclusive final states: same-flavour (e+e^ and ji^fi^) dilepton 
events with and 1 jet for the 7 and 8 TeV data sets, different-flavour e^]A^ dilepton events with 
and 1 jet for the 7 and 8 TeV data sets, dilepton events in the VBF-tag category for the 7 TeV 
data set, and same-flavour and different-flavour dilepton events in the VBF-tag category for 
the 8 TeV data set. All analysis details can be found in Section |7| Figure |32] (upper right) shows 
the 95% CL upper limits on the Higgs boson production cross section. The H — ?► WW — ?► £v£v 
search has reached a sensitivity for excluding the SM Higgs boson at 95% CL in the mass range 
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CMS \ls = 7 TeV, L = 5.1 fb"^ ^ = 8 TeV, L = 5.3 fb"^ 
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Figure 31: The 95% CL upper limits on the production cross section of a Higgs boson expressed 
in units of the SM Higgs boson production cross section, u I crg^, as obtained in the H ^ 77 
search channel for (top) the baseline analysis, (lower left) the cut-based analysis, and (lower 
right) the sideband analysis. The solid lines represent the observed limits; the background- 
only hypotheses are represented by their median (dashed lines) and by their 68% (dark) and 
95% (light) CL bands. 



72 
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CMS \ls = 7TeV, L = 5.1 fb"^ \ls = 8 TeV, L = 5.3 fb"^ 



CMS = 7 TeV, L = 4.9 fb'^ ^ = 8 TeV, L ^ 5.1 fb'^ 



CO 

D 



c 
o 



O 

LO 
05 



D 

o 



O 

LO 



Observed 

Expected (68%) 

Expected (95%) 

Exp. forSMm =125 GeV 




120 130 140 150 160 170 180 

niH (GeV) 

CMS ^ ^ 7 TeV, L ^ 4.9 fb'^ i^s = 8 TeV, L = 5.1 fb'^ 



Observed 

Expected (68%) 

Expected (95%) 

Exp. for SMm =125 GeV 




0^ 
110 



110 120 130 140 150 160 

mn (GeV) 

CMS (s = 7 TeV, L = 5.0 fb'^ ^ = 8 TeV, L ^ 5.1 fb'^ 



CO 

D 
o 



O 
to 



Observed 
Expected (68%) 
Expected (95% 
Exp. forSMm„ = 125 GeV 



115 120 125 130 



135 140 145 
mn (GeV) 




130 135 
mn (GeV) 



Figure 32: The 95% CL upper limits on the production cross section of a Higgs boson expressed 
in units of the SM Higgs boson production cross section, a I (Jsm/ for the following search modes: 
(upper left) H — > ZZ ^ 4£, (upper right) H WW, (lower left) H tt, and (lower right) 
H — > bb. The solid lines represent the observed limits; the background-only hypotheses are 
represented by their median (dashed lines) and by the 68% and 95% CL bands. The signal- 
plus-background expectation (dotted lines) from a Higgs boson with mass wjh = 125 GeV is 
also shown for the final states with a poor mass resolution, WW, tt, and bb. 
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122-160 GeV (the higher-mass range is not discussed in this paper), while the observed data 
exclude it in the mass range 129-160 GeV. The observed exclusion limits are weaker than the 
expected ones for the background-only hypothesis in the entire mass range, suggesting an 
excess of events in data. However, given the mass resolution of about 20% in this channel, 
owing to the presence of the two undetectable neutrinos, a broad excess is observed across 



the mass range from 110 to about 130 GeV. The dotted line in Fig. 32 (b) indicates the median 
expected exclusion limits in the presence of a SM Higgs boson with a mass near 125 GeV. The 
observed limits in this charmel are consistent with the expectation for a SM Higgs boson of 
125 GeV. 

In the H ^ TT channel, the 0-, 1-jet, and VBF categories are used to set 95% CL upper limits on 
the Higgs boson production. The ditau system is reconstructed in four final states: eXh, f/Th, e}i, 
jiji, where the leptons come from t — > evv or t — > }ivv decays. The 0- and 1-jet categories are 
further split into two categories of low or high ditau transverse momentum. The 7 and 8 TeV 
data are treated independently giving a total of 40 ditau mass distributions. All analysis details 
can be found in Section |8] Figure 32 (lower left) shows the 95% CL upper limits on the Higgs 



boson production cross section in this channel. The H — ?► rr search has not yet reached the 
SM Higgs boson exclusion sensitivity; the expected limits on the signal event rates are 1.3-2.4 
times larger than the event rates expected for the SM Higgs boson in this channel. 

In the H — )■ bb analysis, five final states are considered: two b-tagged jets with E^^^^ (Z — )■ vv), 
e^e , fi^ji^ (Z i^i^), e + E^^^^, and }i + £^^^^(W —> iv). Each of these categories is further 
split into two categories of low or high bb transverse momentum. The 7 and 8 TeV data are 
treated independently giving a total of 20 BDT-output distributions. All analysis details can be 



found in Section |9] Figure 32 (lower right) shows the 95% upper CL limits on the Higgs boson 
production cross section in this channel. The H — )■ bb search has not yet reached the SM Higgs 
boson exclusion sensitivity; the expected limits on the signal event rates are 1.2-2.8 times larger 
than the event rates expected for the SM Higgs boson in this channel. 

10.2.2 Combined results 

The five individual search channels described above are combined into a single search for the 



SM Higgs boson. Figure 33 (left) shows the 95% CL upper limits on the signal-strength mod- 
ifier, }i = a I CTgM, as a function of wh. We exclude a SM Higgs boson at 95% CL in two mass 
ranges: 110-121.5 GeV and 128.0-145 GeV. 



The CLg value for the SM Higgs boson hypothesis as a function of its mass is shown in Fig. 33 
(right). The horizontal lines indicate CLg values of 0.05, 0.01, and 0.001. The mass regions where 
the observed CLg values are below these lines are excluded with the corresponding (1 — CLg) 
confidence levels of 95%, 99%, and 99.9%, respectively. The 95% CL exclusion range for the 
SM Higgs boson is identical to that shown in Fig. |33] (left), as both results are simply different 
representations of the same underlying information. At 99% CL, we exclude the SM Higgs 
boson in three mass ranges: 110.0-111.5 GeV, 113.5-121.0 GeV, and 128.5-145.0 GeV. 



Figure 33 (right) shows that, in the absence of a signal, we would expect to exclude the entire 
mn range of 110-145 GeV at the 99.9% CL or higher. In most of the Higgs boson mass range, 
the differences between the observed and expected limits are consistent since the observed 
limits are generally within the 68% or 95% bands of the expected limit values. However, in 
the range 121.5 < mn < 128.0 GeV, we observe an excess of events, making the observed 
limits considerably weaker than expected in the absence of the SM Higgs boson and, hence, 
not allowing the exclusion of the SM Higgs boson in this range. 
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10 Combined results 



10.3 Significance of the observed excess 
10.3.1 Results of searches in the H ^ 77 and H 



ZZ 4£ decay modes 



As presented in Section 10.2.1 the searches for the SM Higgs boson in the 77 and ZZ — ?► 
modes reveal a substantial excess of events with diphoton and four-lepton invariant masses 
near 125 GeV. 



Figure 34 shows the local p-value as a function of the SM Higgs boson mass in the 77 channel. 
The results are presented for the three analyses: (a) baseline analysis, and in the two alternative 



analyses: (b) cut-based analysis, and (c) sideband analysis. Figure 34 (top) shows about a 3cr 
excess near 125 GeV in both the 7 and 8 TeV data. The minimum local p-value po = 1.8 x 10"^, 
corresponding to a local maximum significance of A.lcr, occurs at a mass of 125.0 GeV for the 
combined 7 and 8 TeV data sets. The median expected significance for a SM Higgs boson of 
this mass is 2.7(7. In the asymptotic approximation, 68% (95%) of repeated experiments would 
give results within ±lc7 (±2t7) around the median expected significance. Therefore, the excess 
seen in data, even being larger than the expected median for a Higgs boson signal, is consistent 
with a SM Higgs boson with a probability of about 16%. The consistency of the results from 
the three analyses is a good check on the robustness of the measurement. 

The local p-value as a function of the Higgs boson mass Wh for the ZZ — > A£ channel is shown 
in Fig. 35 The minimum of the local p-value is at jwh = 125.5 GeV and corresponds to a local 
significance of 3.2(7. A local significance of 2.2(7 is found for a ID fit of the invariant mass 
without using the Ku discriminant. The median expected significance for a SM Higgs boson of 
this mass is 3.8(7 and 3.2(7 for the 2D and ID fits, respectively. 
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Figure 33: The 95% CL upper limits on the production cross section of a Higgs boson expressed 
in units of the SM Higgs boson production cross section, a/ asm, (left) and the CLg values (right) 
for the SM Higgs boson hypothesis, as a function of the Higgs boson mass for the five decay 
modes and the 7 and 8 TeV data sample combined. The solid lines represent the observed limits; 
the background-only hypotheses are represented by their median (dashed lines) and by the 68% 
and 95% CL bands. The three horizontal lines on the right plot show the CLg values 0.05, 0.01, 
and 0.001, corresponding to 95%, 99%, and 99.9% confidence levels, defined as (1 — CLg). 
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Figure 34: The local p-value as a function of for the 7 and 8 TeV data sets and their com- 
bination for the 77 mode from (top) the primary analysis, (lower left) the cut-based analysis, 
and (lower right) the side-band analysis. The observed p-values for the combined 7 and 8 TeV 
data sets are shown by the solid lines; the median expected p-values for a SM Higgs boson with 
mass wjh, are shown by the dashed lines. The horizontal lines show the relationship between 
the p-value (left y axis) and the significance in standard deviations (right y axis). 
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Figure 35: The local p-value as a function of nin for the 7 and 8 TeV data sets and their combi- 
nation for the ZZ — ?► A£ channel. The observed p-values for the combined 7 and 8 TeV data sets 
are shown by the solid line; the median expected p-values for a SM Higgs boson with mass nin 
are shown by the dashed line. The observed p-values for the 7 and 8 TeV data sets are shown 
by the dotted lines. The horizontal lines show the relationship between the p-value (left y axis) 
and the significance in standard deviations (right y axis). 



10.3.2 Combined results 

To quantify the inconsistency of the observed excesses with the background-only hypothesis. 



we show in Fig. 36 (left) the local p-value po for the five decay modes combined for the 7 and 
8 TeV data sets. The 7 and 8 TeV data sets exhibit excesses of 3.2(t and 3.8cr, respectively for a 
SM Higgs boson with a mass near 125 GeV. In the combination, the minimum local p-value of 
Pmin = 3 X 10^^, corresponding to a local significance of 5.0t7, occurs at nin = 125.5 GeV. 



Figure 36 (right) gives the p-value distribution for each of the decay channels. The largest 
contributions to the overall excess are from the 77 and ZZ — )■ Ai channels. Both channels 
have good mass resolution and allow a precise measurement of the mass of the resonance 



corresponding to the excess. Their combined significance is 5.0a, as displayed in Fig. 37 (left). 
Figure |37| (right) shows the combined p-value distribution for the channels with poorer mass 
resolution: WW, tt, and bb. 



Table 21 summarizes the median expected and observed local significance for a SM Higgs bo- 
son mass hypothesis of 125.5 GeV from the individual decay modes and their combinations. In 
the TT channel, we do not observe an excess of events at this mass. The expected significance 
is evaluated assuming the expected background and signal rates. The observed significance is 
expected to be within ±lc of the expected significance with a 68% probability. 

The LEE-corrected significance is evaluated by generating 10 000 pseudo-experiments. After 
fitting for the constant C in Eq. ( [l6] l, we find that the global significance of the signal at niu = 
125.5 GeV is 4.60" (4.50") for the mass search range 115-130 GeV (110-145 GeV). 

The low probability for an excess at least as large as the observed one to arise from a statistical 
fluctuation of the background leads to the conclusion that we observe a new particle with a 
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Table 21: The median expected and observed significances of the excesses in the individual de- 
cay modes and their various combinations for a SM Higgs boson mass hypothesis of 125.5 GeV. 
There is no observed excess in the tt channel. 



Decay mode or combination 


Expected (a) 


Observed (a) 


ZZ 


3.8 


3.2 


77 


2.8 


4.1 


WW 


2.5 


1.6 


bb 


1.9 


0.7 


TT 


1.4 




77 + ZZ 


4.7 


5.0 


WW + TT + bb 


3.4 


1.6 


77 + ZZ + WW + TT + bb 


5.8 


5.0 




niH (GeV) (GeV) 



Figure 36: (Left) The observed local p-value for the combination of all five decay modes with 
the 7 and 8 TeV data sets, and their combination as a function of the Higgs boson mass. (Right) 
The observed local p-value for each separate decay mode and their combination, as a function 
of the Higgs boson mass. The dashed lines show the mean expected local p-values for a SM 
Higgs boson with mass mn. 
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Figure 37: The observed local p-value for the 77 and ZZ — )■ A£ decay channels with good 
mass resolution (left) and the WW, bb, and rr modes with poorer mass resolution (right), as 
a function of the Higgs boson mass for the 7 and 8 TeV data sets and their combination. The 
dashed lines show the expected local p-values for a SM Higgs boson with mass mn- 

mass near 125 GeV. The 77 and ZZ — ?► A£ decay modes indicate that the new particle is a 
boson, and the diphoton decay implies that its spin is different from 1 | |159[[r60 | | . 

10.4 Mass of the observed state 



To measure the mass of the observed state, we use the 77 and ZZ — > M decay modes. Figure 38 
(left) shows the 2D 68% CL regions for the signal cross section (normalized to the SM Higgs 
boson cross section) versus the new boson's mass nix, separately for untagged 77, VBF-tagged 
77, and ZZ — )■ 4£ events, and their combination. The combined 68% CL contour shown with a 



solid line in Fig. 38 (left) assumes that the relative event yields between the three channels are 
fixed to the SM expectations, while the overall signal strength is a free parameter. 

The energy scale uncertainties for photons, electrons, and muons are treated as independent. 
The Z — > ee peak is used for correcting both photon and electron energy scales. However, we 
find that they have a very weak correlation, since photons in H — ?> 77 decays and electrons 
in H — > ZZ — > A£ decays have substantially different energy scales. Moreover, the photons 
have an additional systematic uncertainty associated with the extrapolation of the energy scale 
corrections derived for the electrons to the energy scale corrections to be used for the photons. 

To measure the value of nix in a model-independent way, the untagged 77, VBF-tagged 77, 
and ZZ — > Ai channels are assumed to have independent signal cross sections. This is achieved 
by scaling the expected SM Higgs boson event yields in these channels by independent factors 
f/„ where i denotes the individual channel. The signal is assumed to be a particle with a unique 
mass nix- The mass and its uncertainty are extracted from a scan of the combined test statistic q, 
frequently referred to as —2 A In C, versus nix- The signal-strengths jij in such a scan are treated 



in the same way as the other nuisance parameters. Figure 38 (right) shows the test statistic as 



a function of nix for the three final states separately and their combination. The crossing of 
the q{mx) curves with the horizontal thick (thin) lines at q =1 (3.8) defines the 68% (95%) 
CL interval for the mass of the observed particle. These intervals include both the statistical 
and systematic uncertainties. The resulting mass measurement and 68% CL interval in such a 
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Figure 38: (Left) The 2D 68% CL contours for a hypothesized boson mass nix versus ]i = a I crg^ 
for the untagged 77, VBF-tagged 77, and ZZ — )■ 4^ decay channels, and their combination from 
the combined 7 and 8 TeV data. In the combination, the relative signal strengths for the three 
final states are fixed to those for the SM Higgs boson. (Right) The maximum-likelihood test 
statistic c\ versus mx for the untagged 77, VBF-tagged 77, and ZZ — > 4£ final states, and their 
combination from the combined 7 and 8 TeV data. Neither the absolute nor the relative signal 
strengths for the three final states are constrained to the SM Higgs boson expectations. The 
crossings with the thick (thin) horizontal line q = 1 (3.8) define the 68% (95%) CL interval for 
the measured mass, shown by the vertical lines. 
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combination is mx = 125.3 ± 0.6 GeV. 

To determine the statistical component in the overall uncertainty, we evaluate the test statistic 
C]{mx) with all the nuisance parameters fixed to their best-fit values. The result is shown by the 
dashed line in Fig. 39 The crossing of the dashed line with the thick horizontal line =1 gives 
the statistical uncertainty (68% CL interval) in the mass measurements: ±0.4 GeV. The quadra- 
ture difference between the overall and statistical-only uncertainties determines the systematic 
uncertainty component in the mass measurements: ±0.5 GeV. Therefore, the final result for the 
mass measurement is mx = 125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV. 
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Figure 39: The maximum-likelihood test statistic q versus the hypothesized boson mass nix for 
the combination of the 77 and ZZ — ?► A£ modes from the combined 7 and 8 TeV data. The solid 
line is obtained including all the nuisance parameters and, hence, includes both the statistical 
and systematic uncertainties. The dashed line is found with all nuisance parameters fixed to 
their best-fit values and, hence, represents the statistical uncertainties only. The crossings with 
the thick (thin) horizontal line q = 1 (3.8) define the 68% (95%) CL interval for the measured 
mass, shown by the vertical lines. 



10.5 Consistency of the observed state with the SM Higgs boson hypothesis 

The p-value characterizes the probability of the background producing the observed excess of 
events or greater, but it does not give information about the consistency of the observed excess 
with the expected signal. The current data sample allows for only a limited number of such 
consistency tests, which we present in this section. These consistency tests do not constitute 
measurements of any physics parameters per se, but rather show the consistency of the vari- 
ous observations with the expectations for the SM Higgs boson. Unless stated otherwise, all 
consistency tests presented in this section are for the hypothesis of the SM Higgs boson with 
mass 125.5 GeV and all quoted uncertainties include both the statistical and systematic ones. 

10.5.1 Measurement of the signal strength 

The value for the signal-strength modifier fl = a/ ctsm/ obtained by combining all the search 
channels, provides the first consistency test. Note that p becomes negative if the observed 
number of events is smaller than the expected rate for the background-only hypothesis. Fig- 



ure 40 shows the fi value versus the hypothesized Higgs boson mass mn- The band corresponds 
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to the 68% CL region when including the statistical and systematic uncertainties. The value of 
ji is found in 0.5 GeV steps of niu. The measured p value for a Higgs boson mass of 125.5 GeV 
is 0.87 ± 0.23, consistent with the value }i = 1 expected for the SM Higgs boson. 
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Figure 40: The signal-strength fl = cr/ asu as a function of the hypothesized SM Higgs bo- 
son mass niii using all the decay modes and the combined 7 and 8 TeV data sets. The bands 
correspond to ±1 standard deviation including both statistical and systematic uncertainties. 

Figure |4T] shows a consistency test of the p values obtained in different combinations of search 
channels. The combinations are organized by decay mode and additional features that allow 
the selection of events with an enriched purity of a particular production mechanism. The ex- 
pected purities of different combinations are discussed in the sections describing the individual 
analyses. For example, assuming the SM Higgs boson cross sections, the channels with the VBF 
dijet requirements have a substantial fraction (20-50%) of gluon-gluon fusion events. There is 
consistency among all the channels contributing to the overall measurement and their various 
combinations. 

The four main Higgs boson production mechanisms can be associated with either top-quark 
couplings (gluon-gluon fusion and ttH) or vector-boson couplings (VBF and VH). Therefore, 
combinations of channels associated with a particular decay mode and explicitly targeting dif- 
ferent production mechanisms can be used to test the relative strengths of the couplings of the 
new state to the vector bosons and top quark. Figure 42 shows the 68% and 95% CL contours 
for the signal-strength modifiers f<ggH+ttH of the gluon-gluon fusion plus ttH, and f/vBF+VH 
of the VBF plus VH production mechanisms. The three sets of contours correspond to the 
channels associated with the 77, tt, and WW decay modes; searches in these decay modes 
have subchannels with VBF dijet tags. The SM Higgs boson point shown by the diamond at 
?^ggH+ttH/ f<VBF+VH = (1/ 1) is within the 95% CL intervals for each of the three decay modes. 

10.5.2 Consistency of the data with the SM Higgs boson coupiings 

The event yield N of Higgs bosons produced in collisions of partons x {xx — ?► H) and decaying 
to particles y (H — > yy), is proportional to the partial and total Higgs boson decay widths as 
follows: 
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Figure 41: Signal-strength values fi = a/ tJgj^ for various combinations of the search channels 
with Wh = 125.5 GeV. The horizontal bars indicate the ztlc statistical-plus-systematic un- 
certainties. The vertical line with the band shows the combined }i value with its uncertainty. 
(Top) Combinations by decay mode and additional requirements that select events with an 
enriched purity of a particular production mechanism. (Bottom-left) Combinations by decay 
mode. (Bottom-right) Combinations by selecting events with additional requirements that se- 
lect events with an enriched purity of a particular production mechanism. 
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Figure 42: The 68% (solid lines) and 95% (dashed lines) CL contours for the signal strength 
of the gluon-gluon fusion plus ttH production mechanisms (f/ggH+ttn)/ versus VBF plus VH 
(Mvbf+vh)- The three different lines show the results for the decay modes: 77, WW, and tt. 
The markers indicate the best-fit values for each mode. The diamond at (1,1) indicates the 
expected values for the SM Higgs boson. 



r r 

N oc (7{xx H) ■ B{H yy) oc (18) 

Ftot 

where a{xx — )■ H) is the Higgs boson production cross section, S(H — )■ yy) is the branching 
fraction for the decay mode, Ixx and lyy are the partial widths associated with the H — > xx and 
H — > 1/1/ processes, and Ftot is the total width. 

Seven partial widths (Fww/ Fzz/ Ftt, F^b, ^tt, Fgg, F-y-y) and the total width Ftot are relevant for 
the current analysis, where Fgg is the partial width for the Higgs boson decay to two gluons. 
The partial widths Fgg and F-y-y are generated by loop diagrams and thus are directly sensitive 
to the presence of new physics. The possibility of Higgs boson decays to beyond-the-standard- 
model (BSM) particles, with a partial width Fbsm/ is accommodated by making Ftot equal to the 
sum of all partial widths of allowed decays to the SM particles plus Fbsm- 

The partial widths are proportional to the square of the effective Higgs boson couplings to the 
corresponding particles. To test for possible deviations of the measurements from the rates 
expected in different channels for the SM Higgs boson, we introduce different sets of coupling 
scale factors k and fit the data to these new parameters. One can introduce up to eight inde- 
pendent parameters relevant for the current analysis. Significant deviations of the scale factors 
from unity would imply new physics beyond the SM Higgs boson hypothesis. 

The current data set is insufficient to measure all eight independent parameters. Therefore, 
we measure different subsets, with the remaining unmeasured parameters either constrained 
to equal the SM Higgs boson expectations or included in the likelihood fit as unconstrained 
nuisance parameters. 

A. Test of custodial symmetry 

In the SM, the Higgs boson sector possesses a global SU(2)l x SU(2)r symmetry, which is bro- 
ken by the Higgs boson vacuum expectation value down to the diagonal subgroup SU(2)l+r. 
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As a result, the tree-level relations between the ratios of the W and Z boson masses, mw/ mz, 
and their couplings to the Higgs boson, g\N I gz, are protected against large radiative correc- 
tions, a phenomenon known as "custodial symmetry" l|161[|162| . However, large violations of 
custodial symmetry are possible in BSM theories. To test custodial symmetry, we introduce 
two scaling factors Kw and Kz that modify the SM Higgs boson couplings to W and Z bosons, 
and perform two different procedures to determine the consistency of the ratio Awz = ^v^/i<z 
with unity. 

The dominant Higgs boson production mechanism for the inclusive H — > ZZ and untagged 
H — > WW channels is gg — > H. Therefore, the ratio of the event yields for these channels pro- 
vides a test of custodial symmetry. To quantify the test, we introduce two event-rate modifiers 
}izz and Rwz- The expected H — > ZZ A£ event yield is scaled by }izz, while the expected 
untagged H — > WW — > iviv event yield is scaled by Rwz • fizz- The mass of the observed state 
is fixed to 125.5 GeV. The test statistic (|(Rwz) as a function of Rwz, with ^zz included with 



the other nuisance parameters, is shown in Fig. 43 (left) and yields Rwz = 0.9 ^J g, where the 
uncertainty is the combined statistical and systematic. The contributions from VBF and VH 
production to the fit give a small bias of 0.02 when relating the observed event-yield ratio Rwz 
to the square of the ratio of the couplings A^^^- Hence, the current measurements are consistent, 
within the uncertainties, with the expectation from custodial symmetry. 
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Figure 43: (Left) The likelihood test statistic q{Rwz) as a function of the event-rate modifier 
Rwz from the combined untagged H — )■ WW — >• iviv and inclusive H — > ZZ — > Ai searches. 
(Right) The test statistic i?(Awz) as a function of the ratio of the couplings to W and Z bosons, 
AwZ/ from the combination of all channels. The intersection of the curves with the horizontal 
lines q = 1 and 3.8 give the 68% and 95% CL intervals, respectively. 

In the second method, we extract Awz directly from the combination of all search channels. In 
this approach, we use three parameters: Awz/ k:z/ and Kf . The latter variable is a single event- 
rate modifier for all Higgs boson couplings to fermions. The BSM Higgs boson width Fbsm is set 
to zero. The partial width Fgg, induced by quark loops, scales as k^. The partial width T^^ is also 
induced via loop diagrams, with the W boson and top quark being the dominant contributors; 
hence, it scales as |aKw + jS^f where Kw = Awz ■ Kz and the ratio of the factors a and fi, 
jB/a —0.22, is taken from the prediction for the SM Higgs boson with mn = 125.5 GeV ||66|. 
In the evaluation of (?(Awz)/ both kz and Kf are included with the other nuisance parameters. 
Assuming a common scaling factor for all fermions makes this measurement model dependent. 
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but using all the channels gives it greater sensitivity. The results are shown in Fig. 43 (right) by 
the solid line. The dashed line indicates the median expected result for the SM Higgs boson, 
given the integrated luminosity. The measured value is Awz = l-l-o 3' where the uncertainty is 
the combined statistical and systematic. The result is consistent with the expectation of Awz = 1 
from custodial symmetry. In all further combinations presented below, we assume Awz = 1 
and use a common factor Ky to modify the Higgs boson couplings to W and Z bosons. 

B. Test of the couplings to vector bosons andfermions 

We further test the consistency of the measurements with the SM Higgs boson hypothesis by 
fitting for the two free parameters Ky and Kf introduced above. We assume Fbsm = 0/ i-e. no 
BSM Higgs boson decay modes. At lowest order, all partial widths, except for F-y-y, scale either 
as Ky or Kp. As discussed above, the partial width F-y^ scales as ]« + /S kf\^. Hence, 77 is the 
only channel sensitive to the relative sign of JCy and Kp. 



Figure 44 shows the 2D likelihood test statistic over the (jcy, Kf ) plane. The left plot allows for 
different signs of Ky and Kp, while the right plot constrains both of them to be positive. The 68%, 
95%, and 99.7% CL contours are shown by the solid, dashed, and dotted lines, respectively. The 
global minimum in the left plot occurs in the (+, — ) quadrant, which is due to the observed 
excess in the 77 channel. If the relative sign between Ky and Kp is negative, the interference 
term between the W and top-quark loops responsible for the H — > 77 decays becomes positive 
and helps boost the 77 branching fraction. However, the difference between the global mini- 
mum in the (+, — ) quadrant and the local minimum in the (+, +) quadrant is not statistically 
significant since the 95% CL contours encompass both of them. The data are consistent with 
the expectation for the SM Higgs boson: the point at (Ky, Kp) = (1, 1), shown by the diamond, is 
within the 95% CL contour. Any significant deviation from (Ky, Kp) = (1, 1) would imply BSM 
physics, with the magnitude and sign of the Ky and Kp measurements providing a clue to the 
most plausible BSM scenarios. 

Figure |45] displays the corresponding 68% and 95% contours of Ky versus Kp from each of the 
individual decay modes, restricting the parameters to the (+, +) and (+, — ) quadrants (left), 
and the (+, +) quadrant (right). The hypothesis of a "fermiophobic" Higgs boson that couples 
only to bosons is represented by the point at (1, 0). The point is just outside the 95% CL contour, 
which implies that a fermiophobic Higgs boson with = 125.5 GeV is excluded at 95% CL. 

The ID likelihood scans versus Ky and Kp, setting one parameter at a time to the SM value 
of 1, are given in the left and right plots of Fig. |46| respectively. The resulting fit values are: 
Ky = 1.00 ± 0.13 and Kp = 0.5 ± 0.2, where the uncertainties are combined statistical and 
systematic, with corresponding 95% CL intervals of [0.7; 1.3] and [0.2; 1.0], respectively. 

C. Test for the presence of BSM particles 

The presence of BSM particles can considerably modify the Higgs boson phenomenology, even 
if the underlying Higgs boson sector in the model remains unaltered. Processes induced by 
loop diagrams (H — >• 77 and gg — > H) can be particularly sensitive to the presence of new 
particles. Therefore, we combine and fit the data to the scale factors Kj and Kg for these two 
processes. The partial widths associated with the tree-level production processes and decay 
modes are assumed to be unaltered. 

Figure |47| displays the likelihood test statistic in the Kg versus Kj plane, under the assumption 
that Fbsm = 0. The results are consistent with the expectation for the SM Higgs boson of 
(K-y,Kg) = (1, 1). The best-fit value is (K-y,Kg) = (1.5, 0.75). 



Figure 48 gives the likelihood test statistic versus BRbsm = TBSivi/rtot/ with Kg and k^ included 
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Figure 44: The likelihood test statistic in the Ky versus Kf plane. The cross indicates the best- 
fit values. The solid, dashed, and dotted lines show the 68%, 95%, and 99.7% CL contours, 
respectively. The diamond shows the SM point (Ky, Kf ) = (1, 1). The left plot allows for different 
signs of Ky and Kf , while the right plot constrains them both to be positive. 



CMS tfs = 7 TeV, L = 5.1 fb-' (s = 8 TeV, L = 5.3 fb"' CMS = 7 TeV, L = 5.1 fb' /i = 8 TeV L = 5.3 tb' 




0.5 1 1.5 0.5 1 1.5 



Figure 45: The 68% CL contours for the test statistic in the (Ky versus Kf ) plane for individual 
channels (coloured regions) and the overall combination (solid thick lines). The thin dashed 
lines show the 95% CL range for the overall combination. The black cross indicates the global 
best-fit values. The diamond shows the SM Higgs boson point (Ky, Kf) = (1, 1). The point 
(Ky, Kf ) = (1, 0), indicated by the circle, corresponds to the fermiophobic Higgs boson scenario. 
The left plot allows for different signs of Ky and Kf , while the right plot constrains them both 
to be positive. 
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Figure 46: The likelihood test statistic q{Kv;KF = 1) (left) and q{KF;Kv = 1) (right). The in- 
tersections with the horizontal lines = 1 and q = 3.84 mark the 68% and 95% CL intervals, 
respectively, as shown by the vertical lines. 

as unconstrained nuisance parameters. The resulting 95% CL upper limit is BRbsm < 0-89. 
D. Test for differences in the couplings tofermions 



In two-Higgs-boson doublet models (2HDM) II163I , the couplings of the neutral Higgs bosons 
to fermions can be substantially modified with respect to the Yukawa couplings of the SM 
Higgs boson. For example, in the minimal supersymmetric model (MSSM), the couplings of the 
neutral Higgs bosons to up-type and down-type fermions are modified, with the modification 
being the same for all three generations and for quarks and leptons. In more general 2HDMs, 
leptons can be nearly decoupled from the Higgs boson that otherwise would behave like a SM 
Higgs boson with respect to the W and Z bosons and the quarks. To test for such modifications 
to the fermion couplings, we evaluate two different combinations of the corresponding param- 
eters: one in which we allow different ratios of couplings to the up- and down-type fermions 
(-^du = kj/Ku), and the other where we allow different ratios of the couplings to the leptons 
and quarks (A^q = K^/Kq). We assume that Fbsm = 0- 



Figure 49 (left) shows the resulting test statistic versus A^u/ with the other free coupling mod- 
ifiers, JCy and Ku, included as unconstrained nuisance parameters. The relative sign between 
the couplings to up- and down-type fermions is nearly degenerate, which manifests itself in 
the left-right symmetry observed in the plot. The symmetry is not perfect since there is some 
sensitivity to the sign of A^u because of the nonvanishing role of the b quark (in comparison to 
the top quark) in generating the Higgs boson coupling to gluons. Figure |49] (right) displays the 
corresponding results versus A/q, with the two coupling modifiers, Ky and Kq, treated as un- 
constrained nuisance parameters. There are no loop-induced processes measurably sensitive 
to the relative sign of the couplings to leptons and quarks; hence, the plot exhibits a perfect 
left-right symmetry. Both |Adu| and |A^q| are consistent with and 1, with a 95% CL upper 
limit of 1.5 for both. The main reason for both parameters having their best-fit values close 
to is the lack of any event excess in the H — )■ tt channel. However, neither the H — )■ rr 
nor the H — ?► bb channels have reached sufficient sensitivity to place strong constraints on the 
parameters associated with the corresponding Higgs boson couplings. 
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Figure 47: The likelihood test statistic (^(k.^. Kg) assuming Fbsm = 0- The cross indicates the best- 
fit values. The solid, dashed, and dotted contours show the 68%, 95%, and 99.7% CL contours, 
respectively. The diamond shows the SM point {K.y,Kg) = (1, 1). The partial widths associated 
with the tree-level production processes and decay modes are assumed to be unaltered (k = 1). 
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Figure 48: The likelihood test statistic q versus BRbsm = rBSM/Ttot/ with the parameters Kg and 
Kj included as nuisance parameters. The solid curve is the data; the dashed curve indicates 
the expected median results in the presence of the SM Higgs boson. The intersections with the 
horizontal lines q = 1 and 3.8 give the 68% and 95% CL intervals, respectively. The partial 
widths associated with the tree-level production processes and decay modes are assumed to be 
unaltered (k = 1). 
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Figure 49: (Left) Likelihood test statistic q as a function of the ratio A^^ of the coupling to 
the up- and down-type fermions with the coupling modifiers Ky and treated as nuisance 
parameters. (Right) The likelihood test statistic as a function of the ratio A^q of the couplings to 
leptons and quarks with the coupling modifiers Ky and Kq treated as nuisance parameters. The 
solid curves are the results from the data. The dashed curves show the expected distributions 
for the SM Higgs boson. The intersection of the curves with the horizontal lines q =1 and 3.8 
give the 68% and 95% CL intervals, respectively. 

11 Summary 

In this paper, the analyses that were the basis for the discovery of a new boson at a mass of 
approximately 125 GeV have been described in detail. The data were collected by the CMS 
experiment at the LHC in proton-proton collisions at a/s = 7 and 8 TeV, corresponding to 
integrated luminosities of up to 5.1 fb^^ and 5.3 fb^^, respectively. 

The particle is observed in the search for the SM Higgs boson using five decay modes 77, ZZ, 
WW, TT, and bb. An excess of events is found above the expected background, with a local 
significance of 5.0(7", signaling the production of a new particle. The expected significance for a 
SM Higgs boson of that mass is 5.8(7. 

The excess is most significant in the two decay modes with the best mass resolution, 77 and 
ZZ — > A£, and a fit to these invariant-mass peaks gives a mass of 125.3 ± 0.4 (stat.) ± 0.5 (syst.) GeV, 
The decay to two photons indicates that the new particle is a boson with spin different from 
one. Within the SM hypothesis, the couplings of the new particle to vector bosons, fermions, 
gluons, and photons have been measured. All the results are consistent, within their uncertain- 
ties, with expectations for a SM Higgs boson. More data are needed to ascertain whether the 
properties of this new state imply physics beyond the SM. 
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